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SC1A9 29400 bp DNA BCT ll-DEC-1998 

Streptomyces coelicolor cosmid 1A9 . 

AL034446 

AL034446. 1 GI : 4007685 

ADA-like regulatory protein; araC family; gntR; helix-turn-helix; 
His rich; integral membrane; luxR; luxR family; 
methylated-DNA-protein-cysteine methyl transferase; nucleotide 
binding; ogt; oxidoreductase; poxB; pyruvate oxidase; response 
regulator; sensor kinase; sorbitol oxid ase; thiamine 
pyrophosphate; TPP; transferase; transmembrane protein; two 
component system. 
Streptomyces coelicolor A3 (2). 
Streptomyces coelicolor A3 (2) 

Bacteria; Firmicutes; Act inobacteria ; Actinobacteridae ; 
Actinomycetales; Streptomycineae ; Streptomycetaceae ; Streptomyces . 

1 (bases 1 to 29400) 
Saunders, D. C. and Harris, D. 
Unpublished 

2 (bases 1 to 29400) 

Bentley, S.D. , Parkhill,J., Barrell,B.G. and Ra j andream, M . A . 
Direct Submission 

Submitted (04-DEC-1998 ) Streptomyces coelicolor sequencing project, 
Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge 
CB10 ISA E-mail: barrell@sanger.ac.uk Cosmids supplied by Prof. 
David A. Hopwood, [3] John Innes Centre, Norwich Research Park, 
Colney, Norwich, Norfolk NR4 7UH, UK 

3 (bases 1 to 29400) 

Redenbach,M. , Kieser,H.M., Denapaite, D. , Eichner,A., Cullum, J., 
Kinashi,H. and Hopwood, D. A. 

A set of ordered cosmids and a detailed genetic and physical map 

for the 8 Mb Streptomyces coelicolor A3 (2) chromosome 

Mol. Microbiol. 21 (1), 77-96 (1996) 

97000351 

Notes : 

Streptomyces coelicolor sequencing at The Sanger Centre is funded 
by the BBSRC. 

Details of S. coelicolor sequencing at the Sanger Centre are 
available on the World Wide Web. 

(URL; http://www.sanger.ac.uk/Projects/S_coelicolor/) CDS are 
numbered using the following system eg SC7B7.01c. SC (S. 
coelicolor), 7B7 (cosmid name), .01 (first CDS), c (complementary 
strand) . 

The more significant matches with motifs in the PROSITE database 
are also included but some of these may be fortuitous. The length 
in codons is given for each CDS. 

Usually the highest scoring match found by fasta -o is given for 
CDS which show significant similarity to other CDS in the database. 
The position of possible ribosome binding site sequences are given 
where these have been used to deduce the initiation codon . Gene 
prediction is based on positional base preference in codons using a 
specially developed Hidden Markov Model (Krogh et al . , Nucleic 
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Acids Research, 22 (22 ): 4768-4778 ( 1994 ) ) and the FramePlot program 
of Bibb et al., Gene 30:157-66(1984} as implemented at 
http : //www. nih . go . jp/ 

jun/cgi-bin/f rameplot.pl. CAUTION : We may not have predicted the 
correct initiation codon. Where possible we choose an initiation 
codon (atg, gtg, ttg or (att)) which is preceded by an upstream 
ribosome binding site sequence (optimally 5-13bp before the 
initiation codon) . If this cannot be identified we choose the most 
upstream initiation codon. 

IMPORTANT: This sequence MAY NOT be the entire insert of the 
sequenced clone. It may be shorter because we only sequence 
overlapping sections once, or longer, because we arrange for a 
small overlap between neighbouring submissions. Cosmid 1A9 lies 
between 9B2 and 6C5 in the Ase-I-B genomic restriction fragment. 
FEATURES Location/Qualifiers 
source 1 . .29400 

/organism="Streptomyces coelicolor A3(2) ,T 
/strain="A3 (2) " 
/db_xref="taxon: 100226" 
/clone="cosmid 1A9" 
gene complement (1 . .651) 

/gene="SClA9.01c" 
CDS complement (<1 . . 651) 

/gene="SClA9.01c" 

/note="SClA9. 01c, incomplete CDS, possible transferase, 
partial CDS, len: 219 aa, similar to TR:053185 
(EMBL:AL021246) a proposed transferase from Mycobacterium 
tuberculosis (302 aa) , fasta scores: opt: 764, z-score: 
1097.3, E(): 0, (62.1% identity in 203 aa overlap {302 
aa) . Also weakly similar to several methyltransf erases eg. 
METH_ECOLI 

(EMBL:X16584) 5-methyltetrahydrof olate-homocysteine 
methyltransferase (1226 aa), fasta scores; opt: 160, 
z-score: 234.8, E(): 8.4e-06, (30.0% identity in 220 aa 
overlap) . " 
/codon_start=l 
/transl table=ll 
/product = "putative transferase" 
/protein id=" CAA2237 1 . 1 " 
/db_xref="GI : 4007686" 

/translation="MTSDFADALASGPLVLDGGLSNQLEAAGHDLGDALWSARLLAED 
PEAITRAHLAYFEAGAEVAITSSYQATFEGFARRGIGRERAAELLALSVASAREAARR 
ARTARPERALWVAASAGPYGAMLADGSEYRGRYGLGRGALERFHRPRLEVLAAARPDV 
LALETVPDTDEAAALLRAVRGLDVPAWLSYTVAGDRTRAGQPLDEAFALAADVDEVI" 
gene 731. .1654 

/gene="SClA9. 02" 
CDS 731. .1654 

/gene="SClA9. 02" 

/note="SClA9. 02, possible transmembrane protein, len: 
307aa; Contains several possible membrane spanning 
domains . " 
/codon_start=l 
/transl table=ll 

/product="putative transmembrane protein" 
/protein id=" CAA22372 . 1 " 
/db_xref ="GI : 4007687" 

/ trans la t ion- "MTISGRRIRSVRCSPRHSSHGPDEGVPVRFVWQFLAVLVAYAIG 
GIAVQAVKDNDWLTLVVGLTSVALVVFVYAWVVRRTERREALDVALDGAAAKAGWGTL 
IGFGLFGAVITNLFASGYYEVDGLGSVQGAIGLVGFMAAAAATEEVVFRGVLFRI IEE 
HIGTYLALGLTGLVFGLMHLLNEDATLWGALAIAIEAG FMLAAAYAATRNLWLTIGVH 
FGWNFAAGGVFSTVVSGNGDSEGLLDATMSGPKLLTGGDFGPEGSVYSVGFGVLLTLV 
FLWLAHRRGNIVAFGSRRRAAGANSAATLPR" 

qene 1651. .2817 

/gene="SClA9. 03" 

CDS 1651.. 2817 
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/gene="SClA9.03" 

/note="SClA9. 03, probable two component sensor kinase, 
len: 388aa; similar to a family of sensor kinases egs . 
TR:Q53893 (EMBL: U51332 ) AbsAl from Streptomyces coelicolor 
(571 aa), fasta scores; opt: 389, z-score: 278.9, E(): 
2.9e-08, (33.1% identity in 405 aa overlap) and UHPB^ECOLI 
(EMBL:M17102) sensor kinase from Escherichia coli (500 
aa), fasta scores; opt:221, z-score: 248.0, E(): 1.5e-06, 
(25.9% identity in 398 aa overlap). Contains several 
possible membrane spanning domains." 
/codon_start=l 
/transl_table=ll 

/product="putative sensor kinase" 
/protein id=" CAA22373 . 1 " 
/db_xref="GI: 4007688" 

/translation="MIDRRRVLELWRRLDVTVRDLPLGVLLLVASLLPSLRGQGTEIG 
GLPTRPADALAGVAAVLQSI PLAVRRRWTLLCLTLVSLGFALDQLRAYHLFAGAALPI 
VLINAGSHQEKYRRATQVTATLGYVAMAVGLNARGGDETLVEYVTFYLVLALAWGIGA 
WMRSARAAEAERRSRVAEDARNAERTRIARELHDVVTHHVTAMVVQSEAARYLTAAPE 
RLDESLAAVSDTGRRAITDLRHLLDLLNPDHGTAEPRTPPVGRVLTLVEQTRRAGQPV 
EFTEEGTPAAATGSSDLVAYRVVQEALTNALKYDHGGRTSVLVRHGEREITVEVGTDG 
SGSGAASPGGSGRGLAGLRERVDVLGGEFSTDRPADGGFVVRARIPGGSGGSTA" 
gene 2814. .3473 

/gene="SClA9. 04" 
CDS 2814. .3473 

/gene="SClA9.04" 

/note="SClA9. 04, probable luxR family response regulator, 
len: 219 aa; similar to a family of regulators egs. 
TR:069816 ( EMBL : AL0234 96 ) probable two-component regulator 
from Streptomyces coelicolor (224 aa) fasta scores; 
opt:714, z-score: 1103.5, E():0, (54.8% identity in 219 aa 
overlap) and NARL_ECOLI (EMBL : X13360 ) nitrate /nitrite 
response regulator from Escherichia coli {216 aa) fasta 
scores; opt: 465, z-score: 656.5, E(): 2.7e-29, (37.6% 
identity in 213 aa overlap) . Contains Pfam match to entry 
PF00072 response_reg, Response regulator receiver domain, 
score 111.10, E-value 2.1e-29, Pfam match to entry PF00196 
GerE, Bacterial regulatory proteins, luxR family, score 
79.50, E-value 7e-20, PS00622 Bacterial regulatory 
proteins, luxR family signature and an helix-turn-helix 
motif from: 1 to: 219, Score 983 (+2.53 SD) . " 
/codon_start=l 
/transl_table=l_l 

/product="putative response regulator" 
/protein id=" CAA22374 . 1 " 
/db_xref="GI : 4007689" 

/translation="MSAPIRVVICDDQALIRTGLATIVDAQPDLEVVGECGDGQTGVD 
LARELRPDVVVMDIRMPVLDGLEATRLLAGAGVAHPVKVLVVTTFNLDEYVYEALRAG 
ASGFLLKDAPPDRLLHGIRTVAMGAALLDPDVTRRLVGRYAARIRPAEGTARDI PLTP 
RETEVLRLIADGLSNSEIAAALVISPETVKTFVSRILTKLDLRDRVQAVVFAYRHGLV 

misc feature 2826.. 3173 

/gene="SClA9. 04" 

/note="Pfam match to entry PF00072 response_reg, Response 
regulator receiver domain, score 111.10, E-value 2.1e-29" 
misc feature 3276. .3470 

/gene="SClA9. 04" 

/note="Pfam match to entry PF00196 GerE, Bacterial 
regulatory proteins, luxR family, score 79.50, E-value 
7e-20" 

misc feature 3327.. 3410 

/gene="SClA9. 04" 

/note="PS00622 Bacterial regulatory proteins, luxR family 
signature . " 
gene complement (34 92. .4 094) 
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/gene="SClA9.05c" 
CDS complement (34 92 . .4094 ) 

/gene="SClA9.05c" 

/note="SClA9.05c, possible transmembrane protein, len: 200 

aa; contains possible membrane spanning hydrophobic 

regions" 

/codon_start=l 

/transl_table=U 

/product="putative transmembrane protein" 
/protein id=" CAA22375 . 1 " 
/db_xref="GI : 4007690" 

/translation="MFRANAGIRSAPYVHVFPMRGDiyiASTERSDDRPRLRRRDSLWGI 
GVMALLAGVVVRLVLNGTSAWLSALLGAVPAAVWI VWWVRRRRVRDARAVGAEPDDVP 
AMERQI LKGGPAPRDPERRRAMAAFVDSRQERLRRNRWWAFPMLAVI FFGTSALWYLS 
GS VGAGSLMLGLG VVFLGWLAWYNLRI DRRLSHMRGRLRG " 
gene 4140. . 4631 

/gene="SClA9. 06" 
CDS 4140. .4631 

/gene="SClA9. 06" 

/note="SClA9. 06, unknown, len: 163 aa" 

/codon_start=l 

/transl_table=ll 

/product="hypothetical protein SC1A9.06" 
/protein id=" CAA2237 6 . 1 " 
/db_xref="GI : 4007691" 

/translation-"MRDMNDTRTPTSTVTSTRTPADAVTGMVDHVLHLAAGWTRWDGT 

PAHVDGRVYTPHKAIRRVADHLVDHLAELEARLAGEETQPDHWHASLVTTEADRAAFT 

AEDLDEARSRLTRLARIWANRLDALTDEQLDHSPGEGWSFRELAAHLAESGYYADAVG 
DLS" 

gene 4628 . . 5383 

/gene="SClA9.07" 
CDS 4628. .5383 

/gene="SClA9. 07" 

/note="SClA9.07, unknown, len: 251 aa; similar to 
TR:033986 { EMBL : U82823 ) hypothetical protein from 
Saccharopolyspora erythraea {266 aa), fasta scores; opt: 
740, z-score: 869.0, E(): 0, (53.4% identity in 251 aa 
overlap) . " 
/codon__start = l 
/transl_table=n 

/product=="hypothetical protein SC1A9.07" 
/protein id=" CAA22377 . 1 " 
/db_xref="GI : 4007692" 

/translat ion="MTATAFAALHRAGEPLLLPCAWDHASAFALAGQGFRAVGTTSLG 
VAAAAGLPDGASATRDETLRLALVLGSAPFPLSVDAEDGFSDDPDEVGEFARQLAAVG 
AVGINLEDGLGPVGRHAAKIAAVRSAAPGLFVNARTDTYWSGDGDVTETLRRLEAYRE 
AGADGVFVPGLTDPARIGSLAARFDVPLNVLYTPAGPGLAHLADLGVRRVSLGSLLYR 
RALGAALRAAADVRAGRDPGGPTPTYDEVRAPG" 

gene complement (5386. . 5955) 

/gene="SClA9. 08c" 

CDS complement (5386. .5955) 

/gene="SClA9.08c" 

/note="SClA9.08c, possible transcriptional regulator, len: 
189 aa; similar to TR:034892 (EMBL: AF027868 ) proposed 
transcriptional regulator from Bacillus subtilis (191 aa) 
fasta scores; opt: 302, z-score: 488.2, E(): 6.6e-20, 
(31.8% identity in 173 aa overlap). Contains an 
helix-turn-helix motif from: 1 to: 189, Score 1074 (+2 84 
SD) . " 

/codon_start=l 
/transl table=ll 

/product="putative transcriptional regulator" 
/protein id=" CAA22378 . 1 " 
/db_xref="GI : 4007693" 

/ translation="MPRVGLTTDRVVAAAADLADETGFESVTVSALARHFGVKDASLY 
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THVRNLQDLRVRVALLAGGELIEEIAGAVAGRAGKEALAAFAGAYRAYALRHPGRYAA 

TQIRVDQSLVADSAALRRTAEITYGMLRSYGLTEPDLTDAVRLLRSTFHGYCALESSG 

AFGAPRDVRASWDKAVDALHVALENWPRA" 
qene 6107.. 7825 

/gene="SClA9. 09" 
CDS 6107. .7825 

/gene= M SClA9. 09" 

/note="SClA9. 09, unknown, len: 572 aa; Contains His rich 
extreme N-terminal . " 
/codon_start=l 
/transl tableau 

/product="hypothetical protein SC1A9.09" 
/protein id=" CAA22379 . 1 " 
/db_xref="GI : 4007694 " 

/translation="MGHGHGHPHGHHHHGHTHDHEHATEQVLPAAFDTSVPDEALSPA 
QQSRRGLLRRAGLLGAGLAAGTVLAPAATATAAPARAASNGRRGKGFLWLAGDHHIHT 
QYSSDGKYRVVDQVRQGARHGMDWLVITDHGSNTHAKIGVEKVNPDIREARAAHEDTL 
VFQGLEWNIPAAEHGTVFVHPGKHEVSVLKQFETDYDGSVKGAGDSTPANEALAIAGL 
SFLADQVKRRKVKDALMLANHPARKGIDSPHEIRAWRDATSRGHQIAVGFEGAPGHQA 
GGLPEPLGPGGARGIYDGSPSANSFAGYPLESYRTWGGFDWMTATVGGLWDSLIAEGR 
PWWITANSDSHQVYADTGARGGGDFNANGRYDDPVYAGQIDITQNDFWPGQYSRTHVG 
SDGFSYAAVMDGIRAGRVWVDHGQLVSGLDVRVSGGGRWATLGGALHVRRGTRVTLSI 
DVALAGGPNWAGFVPKLARVDVIQGDVTGPAADKDTFTAPTARVVKSYEVDKETGTVR 
LTYDLGRVDRPVYLRTRGTDGNRSAVGSLGAKVDPAGPAI DVVGDADPWRDLWFYSNP 
VWVLPS" 

gene 7822 . . 8346 

/gene="SClA9. 10" 

CDS 7822.. 8346 

/gene= n SClA9. 10" 

/note="SClA9. 10, unknown, len: 174 aa" 

/codon_start=l 

/transl table=ll 

/product="hypothetical protein SC1A9.10" 
/protein id=" CAA22380 . 1 " 
/db_xref="GI : 4007695" 

/ translat ion="MTPPAAPYVLGIDTDATTLREADHLLQALAAELDLPEGVFGCTH 
LVRDGRPRVALSLAAEAEPVLRTARDRLTARGHEVRDGTWDESGRAVLFPGAAALTGT 
LTLAELLARSAVDRVTVLGTPDEPSPDTRLVTRNHVRPHWQDGRLVLAAMPAVGGTLV 
PFEDPDPTPCCADH" 

gene complement (8377 . . 9633) 

/gene="SClA9. 11c" 

CDS complement (8377 . . 9633) 

/gene="SClA9. 11c" 

/note="SClA9 . 11c, oxidoreductase, len: 418 aa, similar to 
many eg. TR:P97011 { EMBL : AB0005 1 9 ) proposed sorbitol 
oxidase from Streptomyces sp. (420 aa) fasta scores; opt: 
1544, z-score: 1630.2, E():0, (60.0% identity in 413 aa 
overlap) and TR:O50531 (EMBL : AL009204 ) FAD-dependent 
oxidoreductase from Streptomyces coelicolor (445 aa) fasta 
scores; opt: 374, z-score: 499.4, E(): 1.6e-20, (31.4% 
identity in 439 aa overlap)." 
/codon_start=l 
/transl_table=ll 

/product="putative oxidoreductase" 
/protein id=" CAA22381 . 1 " 
/db_xref="GI : 4007696" 

/translation="MSDITVTNWAGNITYTAKELLRPHSLDALRALVADSARVRVLGS 
GHSFNEIAEPGDGGVLLSLAGLPSVVDVDTAARTVRVGGGVRYAELARVVHARGLALP 
NMASLPH I SVAGSVATGTHGSGVGNGSLAS VVREVELVTADGSTVVI ARGDERFGGAV 
TSLGALGVVTSLTLDLEPAYEMEQHVFTELPLAGLDPATFETVMAAAYSVSLFTDWRA 
PGFRQVWLKRRTDRPLDGFPYAAPAAEKMHPVPGMPAVNCTEQFGVPGPWHERLPHFR 
AEFT PSSGAELQSE YLMPREHALAALH AMDAI RETLAPVLQTCE I RT VAADAQWLS PA 
YGRDTVAAHFTWVEDTAAVLPVVRRLEEALVPFAARPHWGKVFTVPAGELRALYPRLA 
D FG AL AG AL D PAG K FT N A F V RG VLAG " 
gene 9740. .11068 
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/gene="SClA9. 12" 
CDS 9740. . 11068 

/gene="SClA9. 12" 

/note="SClA9. 12, unknown, len: 442 aa" 

/codon_start=l 

/transl table=ll 

/product="hypothetical protein SC1A9.12" 
/protein id=" CAA22382 . 1 " 
/db_xref-"GI : 4007697" 

/trans la tion="MDSRTALVEDLMERFPHVPREAVFKEDLLRGGVAFDPSALSDTT 
NEAAGEVKPKSYFIFSFDHGTLPELGEAALRRPPEEIILTGGPYDLRRTVVSVRVNPA 
SPYRVAANEDGVLGLYLDGKRIADVGVPPMPEYYRHKLSNGKSVMEVAPTIQWGYLIY 
LTAFRVCQYFGAKEECQYCDINHNWRQHKAAGRPYTGVKDVDEVLEALEIIDKYDTAK 
ISTAYTLTGGAITSKVQGLDEADFYGRYAKAIEEHFPGRWIGKVVAQALPKPDVQRFK 
DYGVQI YHPNFEVWDEYLFKMYCPGKERYVGRDEWHKRILDSTEVFGARNVI PNFVAG 
VEMAEPFGFKTVDEAIESTTEGLRFFMSHGITPRFTTWCPEPTTPLGKTNPDGAPLEY 
HIRLLQAYRQTMEDYGLSSPPGYGPPGAGNAVFSVSSFMDSLPEDAPVEV" 
gene 11316. . 12416 

/gene="SClA9. 13" 
CDS 11316.. 12416 

/gene="SClA9. 13" 

/note="SClA9. 13, possible nucleotide binding protein, len: 
366 aa; similar to several hypotheticals eg. YJEQ_HAEIN 
(EMBL:U32844) hypothetical protein HI1714 from Haemophilus 
influenzae (346 aa) fasta scores; opt:472, z-score: 564.4, 
E(): 3.7e-24, {31.0% identity in 313 aa overlap). Contains 
PS00017 ATP /GTP-binding site motif A (P-loop)." 
/codon_start=l 
/transl table=ll 

/product="putative integral membrane nucleotide binding 
protein" 

/protein id=" CAA223S3 . 1 " 
/db_xref="GI : 4007698" 

/translat ion="MTSTSSHSNHSALSSYGWDDSWADAFAPYAAEGLLPGRVVRVDR 
GQCDVVTADGVLRADTAFVTPHDPLRVVCTGDWVAVEPGGNPRYVRTYLPRRTAFVRS 
TSSKRSEGQILAANVDHAVVAVSLAVELDLARIERFLALAWESGAQPLVVLTKADLVP 
DPVTLAYLVQDVETAAPGVPVLPVSAEQGEGLDVLAAVVSGGTAVLLGQSGAGKSTLA 
NALLGEAAMDVQAIRDVDGKGRHTTTTRNLLALPGGGVLI DTPGLRGVGLFDAGNGVD 
QVFAEIAELAEECRFHDCAHESEPGCAVLAAIDSGALPERRLESYRKLMRENQRI VAK 
T DARARAE I RKE YKRRGAI GKAAMEAKRGGLR " 
misc feature 11937. .11960 

/gene="SClA9. 13" 

/note="PS00017 ATP /GTP-binding site motif A (P-loop)" 
gene 12489. . 13922 

/gene="SClA9. 14" 
CDS 12489. . 13922 

/gene="SClA9. 14" 

/note="SClA9. 14, possible ADA-like regulatory protein, 
len: 477 aa; similar to ADA_MYCTU (EMBL : Z7 3902 ) putative 
ADA regulatory protein from Mycobacterium tuberculosis 
(496 aa) fasta scores; opt: 1135, z-score: 1376.3, E{): 0, 
(51.6% identity in 486 aa overlap). Also note N-terrninal 
region similar to N-terminal region of ADA_ECOLI 
(EMBL:M10211) ADA regulatory protein from Escherichia coli 
(354 aa) fasta scores; opt: 302, z-score: 255.8, E(): 
5.7e-07, (34.5% identity in 171 aa overlap) and C-cerrninal 
region similar to 3MG2_ECOLI AlkA, 3-methyladenine DNA 
glycosylase II involved in DNA repair (282 aa) fasta 
scores; opt: 171, z-score: 232.9, E(): l.le-05, (30.0% 
identity in 290 aa overlap) . Contains Pfam match to entry 
PF00165 HTH_2, Bacterial regulatory helix-turn-helix 
proteins, araC family, score 72.90, E-value 6.7e-18 and 
PS00041 Bacterial regulatory proteins, araC family 
signature . " 
/codon start=l 
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/transl tableau 

/product="putative ADA-like regulatory protein" 
/protein id=" CAA22384 . 1 " 
/db_xref="GI : 4007699" 

/translation="MTPQTVQPAEHADAREDVRYEAVRSRDARFDGAFFFAVETTGIY 
CRPSCPAVTPKRRNVRFFATAAAAQGSGFRACRRCRPDAVPGSADWNVRADVVGRAMR 
LIGDGVVDREGVAGLAGRLGYSARQVQRQLTAEVGAGPVALARAQRAHTARVLLQTTV 
LPVTEIAFASGFASVRQFNDTIRAVYAATPSELRAAAPARDRAARRTATPSAGVPLRL 
AHRGPYQAGPVFDLLQREAVTGVEEVSGETGRRLYRRTLRLPYGTGIVAVQERPGRAG 
TGSGGWLEARLHLTDLRDLTTSVQRLRRLFDLDADPYAVDERLGADPRLAPLVAARPG 
LRSPGTADPAELAVRALVGRTEAERLVQRYGKALDAPCGTLTHLFPEPDVLAGAAPHG 
TPGALAAALADGAVRLDPGADRDDAERALLAVPGL DART VAVVRTRALGDPDVAP PGA 
AVPDTWRPWRSYALNHLRAAGEWENDR" 
misc feature 12819.. 13079 

/gene="SClA9. 14" 

/note="Pfam match to entry PF00165 HTH_2, Bacterial 
regulatory helix-turn-helix proteins, araC family, score 
72.90, E-value 6.7e-18" 
misc feature 12930.. 13058 

/gene="SClA9. 14" 

/note="PS00041 Bacterial regulatory proteins, araC family- 
signature . " 

gene 13919. .14479 

/gene="SClA9. 15" 

CDS 13919. .14479 

/gene="SClA9. 15" 

/note="SClA9. 15, ogt, methylated-DNA-protein-cysteine 
methyltransferase len: 186 aa; similar to many eg. 
DAT1_BACSU (EMBL:X15659) methylated-DNA-protein-cysteine 
methyltransferase from Bacillus subtilis {165 aa) fasta 
scores; opt:357, z-score: 478.2, E(): 2.3e-19, {41.4% 
identity in 157 aa overlap) . Contains Pfam match to entry 
PF01035 Methyltrans, 6-O-methylguanine DNA 
methyltransferase, score 170.70, E-value 2.5e-47." 
/codon_start=l 
/transl_table=l_l 

/product="putative methylated-DNA-protein-cysteine 
methyltransferase" 
/protein_id= " CAA22385. 1 " 
/db_xref="GI : 4007700" 

/translation="MTTTTPTTTTTSIPAETYWHEVDSPVGPLLLTAGSDGALTSLSV 
PGQKGGRSVRDGWRHDAGPFRVAEEQLGAYFAGELTEFSLPLRAQGTAFRERVWAALD 
DVPYGATTTYGEIAARIGASRPAVRAVGGAIGANPLLILRPCHRVIGADGSLTGYAGG 
LERKTRLLSLEGAPLSRPVPLPATPR" 
misc feature 14111. .14446 

/gene="SClA9. 15" 

/note="Pfam match to entry PF01035 Methyltrans, 

6-0-methylguanine DNA methyltransferase, score 170.70, 

E-value 2.5e-47" 
gene complement (144 42 . . 14 924 ) 

/gene="SClA9. 16c" 
CDS complement (14 4 42. . 14924 ) 

/gene="SClA9. 16c" 

/note="SClA9. 16c, small hydrophobic protein, len: 160 aa" 

/codon_start=l 

/transl_table=ll 

/product="small hydrophobic protein" 
/protein id^" CAA22386 . 1 " 
/db_xref="GI : 4007701" 

/ trans la tion="MGAWDLLLAGLVILLGLCGVLLPGVPGSWLVWAGVLWWALKDPR 
PLAWAVLVGSTVVLLLSRAVRWALPTRRKRRDEAMRRLTAYAGAGAVLGFVLVPVLGA 
VPGFMGGI YLAERLRLGRHGEAMASLRTAMRQGGADLLTELFACLLITGAWLGAVLAG 

gene complement (14986. . 15969) 

/gene="SClA9. 17c" 
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CDS complement (14 986. .15969) 

/gene="SClA9. 17c" 

/note="SClA9. 17c, possible regulatory protein, len: 424 
aa; similar to TR:Q56084 ( EMBL : D63904 ) ESA36 protein of 
undefined function from Streptomyces thermoviolaceus (334 
aa) fasta scores; opt: 313, z-score: 401.9, E(): 4.1e-15, 
(30.0% identity in 337 aa overlap). Also similar to 
SW: BRPA_STRHY bialaphos biosynthetic pathway regulatory 
protein from Streptomyces hygroscopicus (256 aa), fasta 
scores; opt: 234, z-score: 233.8, E(): 9.5e-06, (30.11 
identity in 163 aa overlap) . Contains helix-turn-helix 
motif from: 1 to: 424, Score 1064 (+2.81 SD) . Possible 
coiled-coil from 171 to 200 (30 residues) Max score: 1.447 
(probability 0.87)" 
/codon_start=l 
/transl_table=ll 

/product="putative regulatory protein" 
/protein id=" CAA22387 . 1 " 
/db_xref="GI : 4007702" 

/trans la tion="MLGVLGLEDTHEAAYRALVSVGAADVPDLARRLALGERDTERAL 

RRLEQNGLAAQSSARPGRWVAAPPGVALGALLTQQRHELERAELAAALLAEEYRAAAA 

EPAVHDLVEVVTGAGAVAQRFLQLQLGASEEVCALVTDKPVAVTGMENDAEEQATGRG 

VRYRVVVERSVLDLPTGITELTAALGRDEQVRVVDRVPTKLIVADRSLALVPLTARSS 

EPAALVVHASGLLELLCGLFEAVWRDALPLRLGASGVTEQAPDGPDGTDLEILSLLLA 

GLTDASVAKQLDLGLRTVQRRVKRLMELTGVTTRLQLGWHAYERDWVARRD" 
gene 16892 . . 18064 

/gene="SClA9. 18" 

CDS 16892. .18064 

/gene="SClA9. 18" 

/note="SClA9. 18, unknown, len: 390aa" 

/codon_start=l 

/transl_table=l 1 

/product="hypothetical protein SC1A9.18" 
/protein id=" CAA22388 . 1 " 
/db_xref="GI : 4007703" 

/ translat ion="MSSPASGAGRPAAPRRPDPRGPTTAGSGAADLREPPEAVPPPRT 
LPDLPPLPPRFLPSTEADRDYVGSGPPTYDAEPTALPPADPDGLDGLVPDTVLEGARY 
GTCTLRAVSVRGDSARYRGEPRRDALLVARFGAGEQALVLVAMATGARATAGAHRAAA 
EVCRWIGRAVGRSHARLAEDLRAARRGDLKSGLHRLTDRSLGRLRAGAAEQGLAPDEY 
AATLRCLLLPADPGCRTRVFFGVGAGGLLRLRDGAWQDMEPDAGDVTGEPVLGFGSAP 
RDAPRDAPRETPEDDRLTMDLGITTSPGPYEGPPAGPPREPFRFRASVARPGDVLLMC 
TAGLAEPLLSEPGLGELLARRWAARPAPGPGEFLADSGVRVKGYADDRTAAAVWEA" 
gene 18093.. 19835 

/gene="SClA9. 19" 
CDS 18093.. 19835 

/gene="SClA9. 19" 

/note="SClA9. 19, poxB, pyruvate oxidase, len: 580aa; 
similar to many including POXB_ECOLI (EMBL: X04 105) PoxB, 
pyruvate oxidase from Escherichia coli (572 aa) fasta 
scores; opt:2086, z-score: 2458.4, E():0, (53.1% identity 
in 571 aa overlap) . Contains Pfam match to entry PF00205 
TPP_enzymes, Thiamine pyrophosphate enzymes, score 499.70, 
E-value 2 . 4e-154 . " 
/codon_start=l 
/transl table=ll 

/product = "pyruvate dehydrogenase" 
/protein id=" CAA22389 . 1 " 
/db_xref ="GI : 4007704 " 

/translat ion="MAKQNVAEQFVDILTRAGVERLYGVVGDSLNPVVDAVRRHSGIE 
WVHVRHEETAAFAAGAEAQITGKLTACAGSCGPGNLHLINGLYDAHRSMAPVLALASQ 
IPSSEIGLGFFQETHPDQLFRECSHYSELISSPKQMPRLLQTAIQHAVGQGGVSVVSL 
PGDIADEPAPQGAAETALVTSRPTVRPGDEEIDRLVRMIDDADKVTLFCGSGTAGAHA 
EVMEFAGKLKAPVGHALRGKEFIQYDNPYDVGMSGLLGYGAAYEATHECDLLLLIGTD 
FPYNAFLPDDVKIAQI DVRPEHLGRRSKLDLAVWGDARETLRCLI PRVKEKKNRRFLD 
RMLKKHADALEGVVKAYTRKVDKHVPIHPEYVAALLDEMADDDAVFTVDTGMCNVWAA 
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RYISPNGRRRIIGSFSHGSMANALP^4AIGAQFTDRRRQVVSMSGDGGFTMLMGDFLTL 
VQHDLPVKIVLFNNSSLGMVELEMLVAGLPSHGVANKNPDFAAVAEACGAFGVRVEKP 
KDLAGALKAAFKHKG PALVDVVTDPNALSI PPKI SADMVTGFALSASKI VLDGGVGRM 
LQMARSNLRNVPRP" 
misc feature 18132.. 19679 

/gene="SClA9. 19" 

/note="Pfam match to entry PF00205 TPP_enzymes, Thiamine 

pyrophosphate enzymes, score 499.70, E-value 2.4e-154" 
gene 19979.. 20386 

/gene="SClA9.20" 
CDS 19979.. 20386 

/gene="SClA9.20" 

/note="SClA9. 20, possible regulatory protein, len: 135aa; 
similar to TR:O69204 ( EMBL : U33059 ) hypothetical protein 
from Amycolatopsis mediterranei S699 {144 aa) fasta 
scores; opt:225, z-score : 335 . 1 . E(): 2.2e-ll, (41.7% 
identity in 115 aa overlap) . Also similar to TR:Q53897 
(EMBL:X60316) AbaA regulatory locus for antibiotic 
production in Streptomyces coelicolor (192 aa) , fasta 
scores; opt: 155, z-score: 284.6, E(): 1.4e-08, (33.3% 
identity in 120 aa overlap)" 
/codon_start=l 
/transl tableau 

/product="putative regulatory protein" 
/protein id=" CAA22390 . 1 " 
/db xref="GI: 4007705" 

/transl at ion="MPQLRRRLGRADLRAVPEARRALRELLRHWGGPGQSEVAELLAS 

ELVTNALVHTDEGAVLTATVGPRALRVEVRDFVGRGRQPRPRAPQREESTNGRGLVLV 

ESLADDWGVQPCEVGKSVWFELGAGAEAEAEAA" 
gene complement (20444 . .21508) 

/gene="SClA9.21c" 
CDS complement (20444 . .21508) 

/gene="SClA9.21c" 

/note="SClA9 . 21c, possible transmembrane protein, len: 

354aa; contains possible membrane spanning hydrophobic 

domains in the N-terminal region and possible coiled-coil 

from 221 to 250 (30 residues) Max score: 1.449 

(probability 0.87) . " 

/codon_start=l 

/transl_tabie=ll 

/product="putative transmembrane protein" 
/protein _id=" CAA22391 . 1 " 
/db xref="GI: 4007706" 

/trans lation="MRLTDISLNWLLPGAVLLLGMLAAVAVLARGKRSSGKDAGADDS 
WERMEERRRRKEALYGTFSYVLLFCCAAVAAALSFHGLVGFGEQNLGLSDGWQYLVPF 
GLDGAAMFCSVLAVREASHGDAALGSRILVWAFAFAAAWFNWVHAPRGLGHAGAPHFF 
AGMSLSAAVLFDRALKQTRRAALREQGLVPRPLPQIRMVRWLRAPRETYRAWSLMLLE 
GVRSLDEAVEEVRDDRRQKEEKKLRRREQERLERAQLKAISRGHGHRGFPGRGGRQVE 
VEVQQVERGSERAAAE PAISTPEQLPAASRRPSLQPVRSGSEQMSVGTVDLTAEDDTQ 
ALPRLDSLERKLKDLEQQFG" 

gene complement (217 66. .22611) 

/gene="SClA9.22c" 

CDS complement (21766. .22611) 

/gene="SClA9.22c" 

/note="SClA9. 22c, unknown, len: 281aa M 

/codon_start=l 

/transl table=ll 

/product="hypothetical protein SClA9.22c M 
/protein id=" CAA22392 . 1 " 
/db xref="GI : 4007707" 

/translation="MPVARTALTDAYTRLSEVLPGLGVTELAAADEVPSGGGWVTAAS 
LAAGGTELAAFLAWDEAQVLRDYGQRARPDVIASFGLHRYAWPACLLITVPWFLHRRV 
PRHPAAHVSYDRTAAGLPLGRMAVRAASFACLPGDPAAALPGARVVADEEALRAEVRA 
AVAEHLEPVLAGFGPRMRRRGRALWGMATDEVVEGLWYVAHLLGEQERARHELELLLP 
GATRPYVGKAAFRELKGPDGAPLHTRDRASCCMFYTLRPEDTCATCPRTCDADRVDKL 
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LAAAG" 

gene 22780. .23601 

/gene="SClA9.23" 
CDS 22780.. 23601 

/gene="SClA9.23" 

/note="SClA9. 23, possible gntR family transcriptional 
regulator, len: 273aa; similar to TR:069996 
(EMBL:AL022374) WhiH, sporulation transcription factor 
from Streptomyces coelicolor (295 aa) fasta scores ; opt: 
139, z-score: 267.9, E(): 1.2e-07, (29.8% identity in 292 
aa overlap) . Contains Pfam match to entry PF00392 gntR, 
Bacterial regulatory proteins, gntR family, score 30.50, 
E-value 7 . le-08 . " 
/codon_start=l 
/transl _table=ll 

/product="putative gntR family transcriptional regulator" 
/protein id=" CAA22393 . 1 " 
/db xref="GI: 4007708" 

/ trans la tion="MKHSAQGAARTGVGTGTGAAAEAVRIPVQPGAADRARGRAPGGA 
EPDG PARGEHTHGEPPVPRPRALVQRSSVRGQILDALRSALVTGELRPGEVYSAPVLG 
ERFGVSAT PVREAMQQLALEGAVEVVPNRGFRVLERGDRELAELAEVRALIEVPVWLR 
LARTVPAEHWAELRPLAEGTVRAASSGCPATYAEADRAFHRAALALAGNEQLVRI AGD 
VHRRAQWPPAGSPSVRGRADLVADAHQHTALLDALIAGDLDVVRALVGDHFTVTS " 

misc feature 23008.. 23184 

/gene="SClA9 . 23" 

/note="Pfam match to entry PF00392 gntR, Bacterial 
regulatory proteins, gntR family, score 30.50, E-value 
7 . le-08" 

qene complement (23847 . .26234) 

/gene="SClA9.24c" 
CDS complement (23847 . .26234) 

/gene="SClA9.24c" 

/note-"SClA9.24c, SecDF, protein-export membrane protein, 

len: 795aa; similar to many eg. TR:G3220156 

(EMBL:AF024506) SecDF protein from Bacillus subtilis (737 

aa) fasta scores; opt:802, z-score : 1099 . 4 , E():0, (30.7% 

identity in 740 aa overlap)." 

/codon_start=l 

/transl_table=ll 

/product="putative SecDF protein-export membrane protein" 
/protein_id=" CAA22394 .1 " 
/db xref="GI : 4007709" 

/trans la t ion- "MKRSRPPSRSRSRTRSRSLNVRALVALAVMAGAVAIALTMPVRL 
GLDLRGGTQIVLETKSTETTKADREATDRTVEVLRGRIDALGVAEPTI VRSGENRVVV 
ELPGVQDPKKAADVLGRTAQLTVHSVLGAAEKPADATEGRTETAEDGERVLPDESGQS 
LRLKAATLTGQDVKGADARFDQQNGAGWTVTVDFKDSGS DRWAQVTGEAACN PAG DPT 
RRVAIVLDDKIISSPQVDPSVSCGAGITGGSTQITGSFDDAEARELALLIKGGALPVP 
VETIEQRTIGATLGDEAIDAGAWAAVIGTALTALFI I VVYRLMGALATVALLCYGLIS 
YAALAAVGATLTLPGLAG FVLAIGMAVDANVLVFERAREEQAARTRPSTRSALTAGFR 
SAFSAIADSNITTLIAAALLFFLASGPVKGFGVTLGIGVLASMVSALVITRVLAEFAA 
SRPAVFRRPRITGISSTGPVRDALLRRDPFLMRRPRRWLAASLIVLVVAGSGILVRGL 
NFGIEFTGGRLIEYSTATQVDPDRARDALADAGFPRAVVQSSGDGDLTVRTEELTDTE 
AATVTKAVAELGGETEKVRDELIGPSLGEELRRDALIALGLALAAQLAYLAVRFRLLF 
GTAAVGALAHDVVILVGVFAWLGKPIDGVFLAALLTVIGYSVNDSVVLFDRIRELLGK 
ERKAPFDRLTNDAILQTLPRTVNTGMGAVLILASLAILADDSLTDFALALLIGVGVGT 
YSSVFTASPLAIELHNRDTGSRPGRRRGGRTRATGKSEKPGKQVPTSRTERQEVR" 

gene complement (26314 . . 26703) 

/gene="SClA9.25c" 
CDS complement (26314 . .26703) 

/ g ene-"SClA9.25c n 

/note="SClA9.25c, probable secreted protein, len: 129aa; 
Contains a strong signal sequence." 
/codon_start=l 
/trans l_table= 11 

/product="hypothetical protein SClA9.25c" 
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/protein_id=" CAA22395 . 1 " 
/db xref="GI: 4007710" 

/translation="MRRRLASVLTVLLTVLLPLVPAW PAAGAHVGPSGPLAAAAATAV 
PHPALDLHADDGCTPVCAAQPRARHDQPAGRPTAPDQHPATTAHLGGGAAPCGHARTS 

FAPGPVPVSPGRASHDSGRAPPVSSGT" 
qene complement (26808. .27590) 

/ g ene=• l SClA9.26c ,, 
CDS complement (26808. .27590) 

/gene="SClA9. 26c" 

/note-"SClA9. 26c, probable luxR family response regulator, 
len: 260aa; similar to a family of response regulators eg. 
TR:069961 ( EMBL : AL0222 68 ) probable two-component 
transcriptional regulator from Streptomyces coelicolor 
(219 aa) fasta scores; opt:1079, z-score: 1434.4, E():0, 
(77.5% identity in 218 aa overlap) and DEGU_BACBR 
(EMBL:L15444) transcriptional activator protein DegU from 
Bacillus brevis (236 aa) fasta scores; opt: 378, z-score: 
682.5, E(): 9.7e-31, (36.6% identity in 227 aa overlap). 
Contains PS00622 Bacterial regulatory proteins, luxR 
family signature, Pfam match to entry PF00196 GerE, 
Bacterial regulatory proteins, luxR family, score 99.70, 
E-value 5.6e-26, Pfam match to entry PF00072 response_reg, 
Response regulator receiver domain, score 132.90, E-value 
5.6e-36 and an helix-turn-helix motif from: 1 to: 260, 
Score 1221 (+3.35 SD) ." 
/codon_start=l 
/transl_table=ll 

/product="putative response regulator" 
/protein id=" CAA22396 . 1 " 
/db xref="GI: 4007711" 

/translation="MSDPSLPEPSEPSARGRHAGASTTPTTPTSASTTSAYGTPPASA 
PSKIRILLADDHALVRRGVRLILDREPDLEVVAEAGDGAEAIDMARAHEADLAVLDIA 
MPRLTGLQAARELAALKPGLRILMLTMHDNEQYLFQALKSGACGYVLKSVADRDLVAA 
CRAAMRDEPFLYPGAVTALIRNYLDRVRHGEETSDHILTPREEEVLKLVAEGHSSKEI 
AE I LFI S I KTVQRHRANLLQKLGLRDRLELTRYAI RAGL I E P " 
misc feature complement (2 687 4 . . 2 6957) 
/gene="SClA9.26c" 

/note="PS00622 Bacterial regulatory proteins, luxR family 
signature . " 
misc feature complement ( 2 6952 . .27293) 
/gene="SClA9.26c" 

/note="Pfam match to entry PF00072 response_reg , Response 
regulator receiver domain, score 132.90, E-value 5.6e-36" 
misc feature complement ( 27 393 . .27590) 
/gene="SClA9.26c" 

/note="Pfam match to entry PF00196 GerE, Bacterial 
regulatory proteins, luxR family, score 99.70, E-value 
5.6e-26" 

gene complement ( 27599 . .28510) 

/gene="SClA9.27c" 
CDS complement (27599. .28510) 

/gene="SClA9.27c" 

/note="SClA9 . 27c, probable sensor kinase, len: 303aa; 
similar to a family of sensor kinases eg. TR:O69960 
(EMBL: AL022268) probable two-component sensor protein from 
Streptomyces coelicolor (358 aa) fasta scores; opt: 1037, 
z-score: 1477.8, E():0, (59.2% identity in 299 aa overlap) 
and TR:Q52558 (EMBL : U0204 1 ) a two-component sensor 
regulating virulence genes of Pseudomonas solanacearum 
(502 aa) fasta scores; opt: 388, z-score: 475.5, 
E():3.3e-19, (31.5% identity in 270 aa overlap). Contains 
hydrophobic N-terminus . " 
/codon_start=l 
/transl_table=ll 

/product="putative sensor kinase" 
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gene 
CDS 



gene 
CDS 



BASE COUNT 
ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 



/protein id=" CAA22397 . 1 " 
/db xref="GI: 4007712" 

/ trans la tion="MVLGFATALLLWAPVTVSVPILLTEAVVLVVGMGVMLVANG ALL 

RWGLAPLERLTRLMTTVDLLRPGQRLPISGGGEVPELIRTFNAMLDRLENERATSSAR 

VLLAQEAERRRIAQELHDEVGQSMTAILLVLGRAADDAEEPLRDELHQAQEITRESLD 

EVRRLVRRLRPGVLDDLGLISALSSLTHDFATHTGLRVVRRFDADLPVLDHETELVLY 

RVAQESLTNAARHADAERLEVGLAHADAAVTLTVADDGRGIEAAHEGAGIRGMRERAL 

LIGAALDITSAPGAGTRIRLTAPLPRK" 

28890. .29216 

/gene="SClA9.28c" 

28890. .29216 

/gene= n SClA9.28c" 

/note="SClA9.28c, unknown, len: 192aa; some similarity to 
TR:066611 (EMBL: AE000680) putative protein from Aquifex 
aeolicus (145 aa) fasta scores; opt: 188, z-score: 294.0, 
E():4.4e-09, (36.6% identity in 93 aa overlap)" 
/codon_start=l 
/transl_table-ll 

/product="hypothetical protein SClA9.28c" 
/protein id=" CAA22398 . 1 " 
/db xref="GI: 4007713" 

/trans la tion="MSLDASRIEPRPERLTAHEARQRLEHARNTRVTQLQALAESGQA 
DDQLMSAQKAAIERVLKEIDEAFARVEEGTYGACLGCGKPVPGERLEILPYTRYCVAC 

QRRAAA" 
29286. .29400 
/gene="SClA9.29c" 
29286. .>29400 
/gene="SClA9.29c" 

/note="SClA9. 29c, partial CDS, unknown, len: 38aa" 

/codon_start=l 

/transl tableau 

/product="hypothetical protein SClA9.29c" 
/protein id=" CAA22399 . 1 " 
/db_xref= n GI : 4007714 " 

/translation="MNNQIIGDRDTRLPPLSPEDLAALRDNLREQRLFREEQ" 
4022 a 11280 c 10476 g 3622 t 



gatcacctcg 
ggtgcggtcc 
cctcagcagc 
ccgtgcggcg 
gaggccgtac 
cgacgccgcc 
gcgggccgac 
ccgggcgaag 
ctcgaagtag 
cgaccacagc 
gtccagcacg 
ctacgacctg 
ccaccgtgag 
ctcgagccac 
gctggtggcc 
cacgctggtg 
gcggcggacc 
gggctggggg 
ctccgggtac 
gttcatggcc 
catcgaggag 
catgcacctg 
cggcttcatg 
gcacttcggc 
cgacagcgag 
cttcggcccg 
cctgtggctg 
gggcgccaac 



tccacgtcgg 
ccggccaccg 
gccgccgcct 
gccagcacct 
cgtccccggt 
acccacagtg 
gccacgctga 
ccctcgaagg 
gcgaggtggg 
gcgtcgccca 
agcgggccgg 
tccggcgggg 
gtgacgatct 
gggcccgacg 
tacgccatcg 
gtcggtctca 
gagcgccggg 
acgctgatcg 
tacgaggtcg 
gccgccgccg 
cacatcggca 
ctcaacgagg 
ctcgccgccg 
tggaacttcg 
ggcctgctgg 
gagggcagcg 
gcgcaccggc 
tccgccgcta 



cggccagcgc 
tgtacgacag 
cgtcggtgtc 
ccaggcgggg 
actcggaacc 
cgcgctccgg 
gcgcgagcag 
tggcctggta 
cccgcgtgat 
ggtcgtgccc 
aggcgagggc 
ccacgggccc 
ccggccgccg 
aaggagttcc 
gcggcatcgc 
cgtcggtggc 
aggccctgga 
gcttcggact 
acggactcgg 
cgacggagga 
cctacctcgc 
acgccaccct 
cgtacgccgc 
ccgcgggcgg 
acgccaccat 
tgtactcggt 
gcgggaacat 
cacttccccg 



gaaggcctcg 
ccaggcgggc 
cgggaccgtc 
gcggtggaag 
gtccgcgagc 
gcgggccgtc 
ttcggcggcc 
gctggaggtg 
cgcctcgggg 
ggccgcctcc 
gtcggcgaag 
ttgtcccccg 
gatccgtagc 
cgtgaggttc 
cgtccaggcg 
gctggtggtg 
cgtggccctg 
gttcggggcc 
ctcggtccag 
ggtcgtgttc 
gctgggcctg 
gtggggcgcc 
cacccgcaac 
cgtcttcagc 
gtcgggtccg 
gggcttcggg 
cgtggccttc 
gtgatcgatc 



tccagcggct 
acgtccagtc 
tccagcgcga 
cgctccagtg 
atcgctccgt 
cgcgcccggc 
cgctcccgcc 
atcgccacct 
tcctcggcga 
agctggttgg 
tcgctggtca 
agcgagctac 
gtccggtgca 
gtgtggcagt 
gtgaaggaca 
ttcgtgtacg 
gacggggccg 
gtcatcacga 
ggtgcgatcg 
cgcggggtcc 
accggcctcg 
ctcgccatcg 
ctgtggctga 
accgtcgtct 
aagctgctca 
gtcctgctga 
ggctcgcggc 
gacgacgggt 



gcccggcgcg 
cgcgcaccgc 
ggacgtcggg 
cgcccctgcc 
acgggcccgc 
gggcggcctc 
cgatcccgcg 
cggcgcccgc 
gcagccgggc 
acagcccgcc 
tggcacgagg 
agacaggtgt 
gcccgcggca 
tcctggccgt 
acgactggct 
cctgggtggt 
cggccaaggc 
acctcttcgc 
ggctggtcgg 
tgttccggat 
tgttcggcct 
ccatcgaggc 
cgatcggcgt 
ccggcaacgg 
ccggcggcga 
ccctggtgtt 
gccgtgccgc 
cctggagctg 
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1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 
3361 
3421 
3481 
3541 
3601 
3661 
3721 
3781 
3841 
3901 
3961 
4021 
4081 
4141 
4201 
4261 
4321 
4381 
4441 
4501 
4561 
4621 
4681 
4741 
4801 
4861 
4921 
4981 
5041 
5101 
5161 
5221 
5281 
5341 
5401 
5461 
5521 



tggcgtcgac 
tcgctgctgc 
gccgacgcgc 
cggtggacgc 
gcctaccacc 
caggagaagt 
gtcgggctga 
gtcctcgccc 
gaacgccgca 
gagttgcacg 
cggtacctga 
ggccggcggg 
accgccgagc 
cgggccgggc 
tccgacctcg 
cacggcggca 
ggcacggacg 
ggcctgcggg 
ggcggcttcg 
cgccgatccg 
tcgtcgacgc 
tcgacctggc 
tcgacggcct 
tgctcgtggt 
ccagcggctt 
tggccatggg 
cggcccggat 
cggaggtcct 
tgatcagccc 
gcgaccgcgt 
ccgccggccg 
aggttgtacc 
gcaccgaccg 
atcgggaacg 
gccatcgccc 
tccatggcgg 
cggcgcaccc 
cacgccgacg 
ccgatgcccc 
ctcgccatgt 
ttcgcccgga 
tgcgggacat 
ccgacgccgt 
gggacgggac 
gggtcgccga 
agacgcagcc 
tcaccgccga 
ccaaccgcct 
gctttcgcga 
acctgtcgtg 
gccgtgcgcc 
cggcacgacc 
ccgggacgag 
cgacgcggag 
ggcggccgtc 
cgccgcgaag 
ggacacctac 
ccgcgaggcc 
gtccctcgcc 
cctcgcgcac 
gcgggcgctg 
cggtccgacg 
gttctccagc 
cggggcgccg 
caggcgcacg 



tcgacgtcac 
cgtcgctccg 
tggccggggt 
tcctctgcct 
tgttcgcggg 
accggcgtgc 
acgcgcgcgg 
tggcctgggg 
gccgggtcgc 
acgtcgtgac 
ccgccgcgcc 
ccatcaccga 
ccaggacacc 
agccggtgga 
tggcctaccg 
ggacctcggt 
gctccggctc 
agcgggtcga 
tggtccgggc 
ggtcgtgatc 
ccagcccgac 
ccgcgaactg 
ggaggccacc 
gacgacgttc 
cctgctcaag 
cgcggcgctg 
ccgccccgcc 
gcgcctgatc 
cgagaccgtc 
ccaggcggtc 
ctcagccccg 
aggccagcca 
agccggagag 
cccaccagcg 
gccgccgctc 
gcacgtcgtc 
accacacgat 
tgccgttcag 
acaggctgtc 
cacctctcat 
acacgtcgcc 
gaacgacacg 
caccggcatg 
gcccgcccac 
ccacctggtc 
cgaccactgg 
ggatctcgac 
cgacgcgctc 
actggccgcg 
acagcgaccg 
tgggaccacg 
agcctcggtg 
acgctgcggc 
gacgggttca 
ggcgccgtcg 
atcgccgcgg 
tggtcgggcg 
ggtgccgacg 
gcccgcttcg 
ctcgccgacc 
ggcgccgcgc 
cccacgtacg 
gccacgtgca 
aaggcgccgg 
gcgtcggtga 



ggtccgggac 
cggtcagggc 
ggcggccgtc 
caccctggtc 
cgccgcgctc 
cacgcaggtc 
cggcgacgag 
catcggcgcg 
cgaggacgcc 
ccaccacgtg 
cgagcgcctc 
cctgcggcac 
acccgtcggc 
gttcaccgag 
tgtcgtccag 
cctggtgcgg 
gggggccgcg 
cgtactgggc 
ccggataccc 
tgcgacgacc 
ctcgaggtgg 
cggccggacg 
cgcctgctcg 
aacctggacg 
gacgcgcccc 
ctcgaccccg 
gagggcaccg 
gcggacgggc 
aagaccttcg 
gtcttcgcct 
caaccggccg 
gcccaggaac 
gtaccacagg 
gttgcgccgc 
cgggtcccgg 
cggttccgca 
ccagacggcg 
gaccaaccgg 
ccgcctgcgc 
ggggaagacg 
gggacacttc 
cgaaccccca 
gtcgaccacg 
gtcgacggcc 
gaccacctcg 
cacgcctccc 
gaggcccgca 
accgacgagc 
cacctcgcgg 
ccttcgccgc 
cctccgcctt 
tcgcggcggc 
tggccctcgt 
gcgacgaccc 
ggatcaacct 
tccgctcggc 
acggcgacgt 
gcgtcttcgt 
acgtgccact 
tcggcgtacg 
tgcgggcggc 
acgaggtgcg 
gcgcgtcgac 
aggactccag 
ggtcgggctc 



ctcccgctcg 
acggagatcg 
ctccagtcca 
tcgctcggct 
ccgatcgtgc 
accgccaccc 
acgctggtcg 
tggatgcgct 
cgcaacgccg 
acggcgatgg 
gacgagagtc 
ctgctcgacc 
cgggtgctca 
gagggcaccc 
gaggccctga 
cacggggaac 
tcccccggcg 
ggcgagttca 
ggcgggagcg 
aggcactcat 
tcggcgagtg 
tcgtggtgat 
ccggtgccgg 
agtacgtcta 
cggaccggct 
atgtgacgcg 
cgcgggacat 
tgtccaacag 
tctcccgcat 
accgccacgg 
cgcatgtgcg 
acgaccccga 
gccgaggtgc 
agccgctcct 
ggcgcgggcc 
ccgaccgccc 
gccggcaccg 
accacgacgc 
agccgcggac 
tggacgtacg 
ggccggcgcc 
cgagcaccgt 
tcctgcacct 
gcgtctacac 
cggagctgga 
tggtcaccac 
gccgcctcac 
agctcgacca 
agtccgggta 
gctgcaccgc 
cgccctcgcc 
ggccggactg 
cctcggctcg 
cgacgaggtg 
ggaggacggt 
cgcccccggc 
cacggagacg 
ccccgggctc 
caacgtcctg 
ccgcgtcagc 
cgccgacgtc 
ggcgccgggc 
cgccttgtcc 
cgcgcagtac 
ggtgaggccg 



gggtgctgct 
gcggcctgcc 
tccccctggc 
tcgccctcga 
tgatcaacgc 
tcgggtacgt 
agtacgtgac 
ccgcgcgggc 
aacggacccg 
tcgtgcagtc 
tggccgcggt 
tcctcaaccc 
cgctggtcga 
cggcggcggc 
ccaacgccct 
gggagatcac 
gcagcgggcg 
gcaccgaccg 
ggggaagcac 
ccgcaccggg 
cggggacggg 
ggacatccgc 
agtggcgcac 
cgaggcgttg 
gctgcacggc 
ccgcctcgtg 
tccgctgacc 
cgagatcgcc 
cctcaccaaa 
gctggtgacc 
acagacgccg 
ggccgagca t 
cgaagaagat 
ggcgcgagtc 
cgcccttgag 
tggcgtcgcg 
cgcccagcag 
cggccagcag 
ggtcgtcgct 
gtgcagaccg 
gaagcaaccg 
caccagcacc 
ggccgccggc 
cccgcacaag 
ggcccggctg 
ggaagcggac 
ccggctcgcc 
ctcgccgggc 
ctacgccgac 
gccggtgagc 
gggcagggct 
cccgacgggg 
gcccccttcc 
ggggagttcg 
ctcgggcccg 
ctgttcgtca 
ctgcggcgcc 
accgaccccg 
tacacgcccg 
ctcggctcgc 
cgggcgggcc 
tgacctcagg 
caggacgccc 
ccgtggaagg 
taggaccgca 



gctcgtcgcg 
gacccgcccc 
cgtgcgccgg 
ccaactgcgc 
gggctcccac 
ggccatggcc 
gttctatctg 
cgcggaggcc 
catcgcccgc 
cgaggcggcc 
cagcgacacc 
cgaccacggc 
gcagacccgc 
caccggcagc 
caagtacgac 
ggtggaggtc 
ggggctggcg 
tccggcggac 
ggcatgagcg 
ctcgcgacga 
cagaccgggg 
atgccggtgc 
cccgtcaagg 
cgcgcgggcg 
atccgcaccg 
ggccggtacg 
ccccgggaga 
gcggccctcg 
ctcgacctgc 
tgacgcccgg 
gtcgatgcgc 
caggcttccg 
gacggccagc 
gacgaacgcc 
gatctgccgc 
cacgcgccgg 
agccgacagc 
ggccatcact 
tctctctgtg 
gatacccgcg 
gcccatagcg 
cgcacgcccg 
tggacccgct 
gcgatccggc 
gcgggggagg 
cgcgccgcct 
cggatctggg 
gagggctgga 
gcggtgggcg 
ccctgctgct 
tccgcgccgt 
cgtccgcgac 
cgctccccgt 
cgcggcagct 
tcggccggca 
acgcccgcac 
tggaggccta 
cccgcatcgg 
ccggacccgg 
tgctctaccg 
gggacccggg 
cgcgcggcca 
gcacgtcccg 
tgctgcgcag 
gcatgccgta 
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5581 ggtgatctcg gcggtgcggc gcagcgcggc 
5641 gatctgcgtg gcggcgtagc ggccggggtg 
5701 gaaggcggcc agcgcctcct tcccggcccg 
5761 cagctcgccg ccggccagca gcgcgaccct 
5821 cgtgtacagg ctggcgtcct tcaccccgaa 
5881 ctcgaagccc gtctcgtcgg ccaggtcggc 
5941 acccactcgg ggcatgggcc gccttcccgc 
6001 caggacgggc ggcagccgcg gatcacccgc 
6061 ttctcctcga agccgcgcac ggcggcggtc 
6121 cggacacccg cacggtcacc accaccacgg 
6181 acaggtcctg cccgccgcct tcgacaccag 
6241 gcagtcgcgc cggggcctgc tgcgccgtgc 
6301 taccgtcctc gccccggccg ccaccgccac 
6361 ccgccgcggg aagggcttcc tctggctggc 
6421 cagcgacggc aagtaccgcg tcgtcgacca 
6481 ctggctggtc atcaccgacc acggcagcaa 
6541 caaccccgac atccgcgagg cccgcgccgc 
6601 ggagtggaac atcccggccg ccgagcacgg 
6661 ggtctccgtc ctcaagcagt tcgagaccga 
6721 ctccaccccc gccaacgagg ccctcgccat 
6781 caagcggcgc aaggtcaagg acgccctgat 
6841 cgactccccg cacgagatcc gcgcctggcg 
6901 ggtcggcttc gagggcgccc cgggccacca 
6961 gggcggcgcc cgcggcatct acgacggcag 
7021 cctggagagc taccgcacct ggggcggctt 
7081 gtgggacagc ctcatcgccg agggccgccc 
7141 ccaggtctac gccgacaccg gcgcccgcgg 
7201 cgacgacccg gtctacgcgg ggcagatcga 
7261 gtacagccgc acccacgtcg gctccgacgg 
7321 ccgggccggc cgcgtctggg tcgaccacgg 
7381 ctccggcggc ggccgctggg ccacgctcgg 
7441 ggtcaccctc tccatcgacg tcgccctggc 
7501 gaagctggcc cgcgtcgacg tcatccaggg 
7561 caccttcacc gcgccgaccg cccgggtcgt 
7621 caccgtccgc ctcacctacg acctcggccg 
7681 cggcaccgac ggcaaccggt ccgccgtcgg 
7741 ccccgccatc gacgtcgtcg gcgacgccga 
7.801 cccggtctgg gtcctgccct cgtgaccccg 
7861 accgacgcga cgaccctgcg cgaggccgat 
7921 gacctccccg agggcgtctt cggctgcacc 
7981 gccctgtccc tcgccgccga ggcggagccg 
8041 gcccggggcc acgaggtccg cgacggcacc 
8101 cccggtgcgg ccgccctcac cggcaccctg 
8161 gtcgaccggg tgaccgtcct cggcacaccg 
8221 acccggaacc acgtccgccc gcactggcag 
8281 gcggtcggcg gcaccctcgt accgttcgag 
8341 cactgacccg gcccggcgcc gaggggcgcg 
8401 gaacgcgttg gtgaacttcc ccgccgggtc 
84 61 cagccgcggg tacagcgcac gcagctcgcc 
8521 cggacgggcc gcgaagggga cgagcgcctc 
8581 cgccgtgtcc tcgacccagg tgaagtgcgc 
8641 cagccactgc gcgtcggcgg cgaccgtgcg 
8701 cgtctcccgt atcgcgtcca tcgcgtgcag 
8761 gtactccgac tgcaactcgg caccgctgct 
8821 ccgctcgtgc cagggccccg gcaccccgaa 
8881 cggcaccgga tgcatcttct cggcggccgg 
8941 gtcggtgcgc cgcttcagcc acacctgccg 
9001 actcacgctg tacgccgccg ccatcaccgt 
9061 cggcagctcg gtgaagacgt gctgttccat 
9121 cgacgtcacc acgcccagcg cgccgagcga 
9181 ccgcgcgatc accacggtcg aaccgtccgc 
9241 ggccagcgaa ccgttgccca cccccgaacc 
9301 cgagatgtgc ggcagcgagg ccatgttcgg 
9361 ggccagctcc gcgtaccgca caccgccgcc 
9421 gtccaccacg gacggcaggc ccgccagcga 



ggagtcggcg accagggact ggtcgacgcg 
gcgcagggcg tacgcccggt aggcgcccgc 
gccggccacc gctccggcga tctcctcgat 
gacgcgcagg tcctggaggt tgcgcacgtg 
gtggcgggcg agcgcggaga cggtgaccga 
ggccgcggcg acgacgcggt ccgtggtgag 
aacctggagt tcctaggttt catgctaggg 
gagtcgttca acgttcacct tccccgccga 
ttcaggagga acacgcatgg ggcacggaca 
acatacgcac gaccacgagc acgccaccga 
cgtccccgac gaggccctga gccccgcgca 
cggcctgctc ggcgccggac tggccgccgg 
cgccgccccc gcgcgggccg cgagcaacgg 
cggcgaccac cacatccaca cccagtacag 
ggtccgccag ggcgcccggc acggcatgga 
cacccacgcc aagatcggtg tggagaaggt 
ccacgaggac accctcgtct tccagggcct 
caccgtcttc gtccaccccg gcaagcacga 
ctacgacggc agcgtcaagg gcgcgggcga 
cgccggactc tccttcctcg ccgaccaggt 
gctcgccaac cacccggcgc gcaagggcat 
cgacgccacc tcccgcggcc accagatcgc 
ggcgggcggc ctgcccgagc ccctcggccc 
cccgagcgcc aactccttcg ccggctaccc 
cgactggatg accgccaccg tcggcggcct 
ctggtggatc accgccaact ccgactccca 
cggcggcgac ttcaacgcca acggccgcta 
catcacccag aacgacttct ggcccggcca 
cttctcctac gccgccgtca tggacggcat 
gcagctcgtc agcggcctcg acgtccgcgt 
cggcgccctg cacgtccgca ggggcacccg 
cggcggcccc aactgggccg gattcgtgcc 
cgacgtgacc ggaccggccg ccgacaagga 
caagtcctac gaggtcgaca aggaaaccgg 
cgtcgaccgc cccgtctacc tccgcacccg 
ctcgctgggc gccaaggtcg acccggccgg 
cccgtggcgc gacctgtggt tctactccaa 
cccgccgcgc cgtacgtcct cggcatcgac 
cacctgctcc aggcgctggc ggccgaactc 
cacctggtgc gggacggccg cccgpgggtc 
gtcctgcgca ccgcccggga ccggctcacg 
tgggacgaga gcggccgcgc ggtcctcttc 
accctcgccg agctgctcgc ccgctccgcc 
gacgagccga gcccggacac ccggctggtc 
gacggccgcc tcgtgctggc cgccatgccc 
gaccccgacc cgacaccctg ctgcgccgac 
gggtgctcag cccgcgagca ccccgcgcac 
cagcgccccg gccagcgccc cgaagtcggc 
cgccgggacg gtgaacacct tcccccagtg 
ctccagccgc cgcaccaccg gcagcaccgc 
ggccacggtg tcccgcccgt acgccgggct 
gatctcgcag gtctggagca ccggcgcgag 
ggcggccagg gcgtgctccc ggggcatcag 
gggcgtgaac tccgcgcgga agtgcggcag 
ctgctccgtg cagttcaccg cgggcatgcc 
ggccgcgtac gggaaaccgt ccagcggccg 
gaaaccgggc gcccgccagt cggtgaacag 
ctcgaacgtc gccgggtcca acccggccag 
ctcgtacgcc ggctccaggt cgagtgtgag 
ggtcaccgcc ccgccgaacc gctcgtcgcc 
ggtgaccagc tccacctcgc gcaccaccga 
gtgggtgccg gtggccaccg acccggcgac 
cagcgccagg ccccgcgcgt gcaccacccg 
gacccgcacc gtacgggccg ccgtgtccac 
cagcaggaca cccccgtcgc ccggctcggc 
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9481 gatctcgttg aaggagtgcc cgctgcccag 
9541 ggcccgcagc gcgtccaggg agtgcggccg 
9601 gccggcccag ttggtgaccg tgatgtcgct 
9661 gagggccccc gcgcgggaag cctctcccgc 
9721 acccgggagg agacacggga tggacagccg 
9781 gttcccgcac gtaccgcgcg aggccgtctt 
9841 cttcgacccc tccgcactca gtgataccac 
9901 gtcgtacttc atcttctcct tcgaccacgg 
9961 gcgccgcccg cccgaggaga tcatcctcac 
10021 cgtctcggtg cgcgtgaacc cggcctcgcc 
10081 cctcggcctc tacctcgacg gcaagcggat 
10141 gtactaccgg cacaagctct ccaacgggaa 
10201 gtggggctac ctgatctacc tgaccgcgtt 
10261 ggagtgccag tactgcgaca tcaaccacaa 
10321 gtacaccggt gtgaaggacg tcgacgaggt 
10381 cgacaccgcc aagatctcca ccgcgtacac 
10441 ccagggtctg gacgaggccg acttctacgg 
10501 ccccggccgc tggatcggca aggtcgtcgc 
10561 cttcaaggac tacggcgtgc agatctacca 
10621 gttcaagatg tactgccccg gcaaggagcg 
10681 catcctggac tccaccgagg tcttcggcgc 
10741 cgtcgagatg gccgagccct tcggcttcaa 
10801 cgagggcctg cgcttcttca tgtcgcacgg 
10861 cgagcccacc accccgctcg gcaagaccaa 
10921 ccgcctgctc caggcctacc gccagaccat 
10981 ctacggcccg cccggcgcag gcaacgcggt 
11041 gccggaggac gcgcccgtgg aggtgtgagc 
11101 gccttcgggc gcgcgactag ggtgaccgat 
11161 aggggtcttt tccgaggagg tgtcgaccaa 
11221 tccaggagtc catcgagtcc acctccgcgg 
11281 tctcgcgccg gggccgcacc ccgaagggtc 
11341 accactccgc cctgtcctcc tacggctggg 
11401 acgccgccga agggctgctg cccggccggg 
114 61 tcgtcaccgc ggacggcgtc ctgcgcgccg 
11521 tgcgcgtcgt ctgcaccggc gactgggtcg 
11581 tacgcacgta tctgccccgc cgtaccgcct 
11641 aggggcagat cctcgcggcc aacgtcgacc 
11701 agctggacct cgcccggatc gaacggttcc 
11761 ccctggtcgt cctcaccaag gccgacctcg 
11821 tccaggacgt ggagaccgcc gcgcccggcg 
11881 gcgaggggct cgacgtgctc gccgccgtcg 
11941 agtccggcgc cggcaagtcc accctcgcca 
12001 tgcaggccat ccgcgacgtc gacggcaagg 
12061 tcgccctgcc cggcgggggc gtgctgatcg 
12121 tcgacgcggg caacggggtg gaccaggtgt 
12181 gccggttcca cgactgcgcc cacgagagcg 
12241 acagcggtgc actgcccgag cggcggctgg 
12301 agcgcatcgt cgccaagacc gacgcccggg 
12361 gccggggcgc catcggcaag gcggcgatgg 
12421 ccggccgctc caacgcgatg tccgatttcc 
12481 tggacagcgt gacaccacag acagtgcagc 
12541 tccggtacga ggcagtgcgg agccgggacg 
12601 tcgagacgac cggcatctac tgccggccca 
12661 acgtgcggtt cttcgcgacg gccgccgccg 
12721 ggtgccgccc cgacgccgtg ccgggctccg 
12781 gccgggccat gcggctgatc ggcgacggcg 
12841 ccggacggct cggctacagc gcccgccagg 
12901 ccggacccgt ggccctcgcc cgcgcccagc 
12961 ccaccgtcct gccggtcacc gagatcgcct 
13021 tcaacgacac gatccgggcc gtgtacgcgg 
13081 cggcccgcga ccgggccgcc cggcgcacgg 
13141 tggcccaccg cggcccctac caggccggcc 
13201 tcaccggcgt cgaggaggtg agcggcgaga 
13261 gcctgccgta cggcaccggg atcgtcgccg 
13321 gcagcggggg ctggctggag gcccgcctcc 



cacccgcacc ctggcgctgt ccgccaccag 
cagcagttcc ttcgccgtgt acgtgatgtt 
catagcgcgg aaacctaccc ggcgcgtctg 
gggggcatac cgtgagaagt cgaacgccgg 
taccgcgctg gtcgaggatc tgatggagcg 
caaggaggac ctgctgcgcg gcggtgtcgc 
caacgaggca gccggcgagg tcaagccgaa 
caccctgccc gagttgggcg aggccgccct 
cggcggcccc tacgacctgc gccgcaccgt 
ctaccgggtc gccgcgaacg aggacggcgt 
cgccgacgtc ggcgtgccgc ccatgcccga 
gtcggtcatg gaggtggccc cgaccatcca 
ccgcgtctgc cagtacttcg gcgccaagga 
ctggcgccag cacaaggcgg ccgggcggcc 
cctcgaagcg ctggagatca tcgacaagta 
gctcaccggc ggcgcgatca cgtcgaaggt 
ccggtacgcc aaggccatcg aggagcactt 
ccaggccctg cccaagccgg acgtccagcg 
ccccaacttc gaggtgtggg acgagtacct 
ctacgtcggc cgcgacgagt ggcacaagcg 
ccgcaacgtc atccccaact tcgtcgcggg 
gaccgtcgac gaggccatcg agtccaccac 
catcacgccc cgcttcacca cctggtgccc 
ccccgacggc gctccgctgg agtaccacat 
ggaggactac ggtctctcct cgccccccgg 
cttctcggtc agctccttca tggacagcct 
ggacggttcc ggggtttatt cggtggcgcg 
gtccgagcgg cggcccaagc ggccgccgac 
tggctgtcac tgtgacgggc gctgcccgca 
ccaccggctg atctcagccg ctccctgagt 
accccttgac ttccacgtct tcccactcca 
acgactcctg ggccgacgcg ttcgcgccgt 
tcgtgcgggt cgaccgcgga cagtgcgacg 
acaccgcgtt cgtcacgccg cacgacccgc 
ccgtcgaacc gggcggcaac ccgcgctacg 
tcgtgcgctc cacctcgtcc aagcggtccg 
acgccgtcgt cgccgtgtcc ctcgcggtcg 
tcgccctcgc ctgggaatcc ggggcgcagc 
tgcccgaccc ggtgaccctg gcgtacctcg 
tgccggtgct gccggtcagc gccgagcagg 
tctccggcgg taccgccgtg ctgctcgggc 
acgcgctgct cggcgaggcc gccatggacg 
gccggcacac caccaccacc cgcaacctcc 
acacccccgg actgcggggc gtcgggctct 
tcgccgagat cgcggagctg gccgaggagt 
agccgggctg tgccgttctc gccgccatcg 
agagctaccg caagctgatg cgggagaacc 
cgcgggccga gatccgcaag gagtacaagc 
aggccaagcg gggtggcctg aggtagcccg 
gccagcccgg tccgccggtc tggcggagac 
cggcagagca cgcggacgcg cgcgaggacg 
cccggttcga cggcgcgttc ttcttcgccg 
gctgccccgc cgtcacaccg aagcggcgca 
cgcagggctc gggcttccgg gcctgccggc 
ccgactggaa cgtccgcgcc gacgtcgtcg 
tggtggaccg cgagggcgtc gccggactcg 
tgcagcgcca gctcaccgcc gaggtgggcg 
gggcgcacac cgcgcgggtc ctgctgcaga 
tcgcgtccgg gttcgccagc gtgcggcagt 
ccaccccgag cgagctgcgc gccgccgccc 
ccaccccctc cgccggcgtc ccgctgcgcc 
ccgtcttcga cctgctccag cgggaggccg 
ccggccgccg cctctaccgg cgcaccctgc 
tccaggaacg gccgggccgg gcgggcaccg 
acctcaccga cctgcgcgac ctgaccacct 
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13381 ccgtacagcg gctgcggcgg ctgttcgacc 
13441 ggctcggcgc cgacccccgg ctcgcccccc 
13501 cgggcaccgc cgaccccgcg gaactggcgg 
13561 agcggctcgt ccagcggtac ggcaaggcgc 
13621 tcttccccga accggacgtc ctcgccgggg 
13681 ccgccgccct cgccgacggc gccgtacgcc 
13741 agcgcgcgct gctcgccgtg cccggcctgg 
13801 gcgccctcgg cgacccggac gtcgccccgc 
13861 cctggcgctc gtacgcactg aaccacctgc 
13921 gaccacgacc acgccgacga ccacgaccac 
13981 ggtcgacagc cccgtcgggc cgctgctcct 
14041 cctgtccgtg cccgggcaga agggcggccg 
14101 cgggcccttc cgggtggccg aggaacagct 
14161 gttctcgctg ccgctgcgcg cgcagggcac 
14221 ggacgacgtg ccctacggcg cgaccacgac 
14281 gtcgcggccc gcggtacgcg ccgtcggggg 
14341 gcgcccctgc caccgggtga tcggcgccga 
14401 ggaacgcaag acgcggctgc tgtcgctgga 
14461 cccagccacg ccccggtgat cagcaggcag 
14521 ccctggcgca tcgccgtccg cagcgacgcc 
14581 cgctcggcga ggtagatgcc gcccatgaag 
14 641 acgaagccga ggacggcgcc cgcgcccgcg 
14701 cggcgtttcc tgcgggtcgg cagcgcccag 
14761 gtggaaccga ccaggaccgc ccacgcgagc 
14821 accccggccc acaccagcca cgatcccggc 
14881 agcagaatga ccaggccggc cagcaggagg 
14941 agtccggcca ggtccgcaga agtccggccc 
15001 cacccagtcc cgctcgtagg cgtgccagcc 
15061 ctccatcagg cgtttcaccc gtcgctgcac 
15121 cacgctcgcg tcggtcagcc cggcgagcag 
15181 ggggccgtcc ggggcctgct cggtcacgcc 
15241 ccgccacacg gcctcgaaca ggccacacag 
15301 cagcgcggcg ggttccgagg agcgcgcggt 
15361 cacgatcagc ttggtcggca cccggtccac 
15421 ggcggtcagc tcggtgatgc cggtcggcag 
15481 gtagcggaca ccgcgcccgg tggcctgctc 
15541 caccggcttg tcggtgacca gcgcgcacac 
15601 gaagcgctgg gcgacggccc cggcgccggt 
15661 ctcggccgcc gccgcccggt actcctccgc 
15721 cagctcgtgc cgctgctggg tgagcagggc 
15781 ccagcgtccc gggcgggccg aggactgggc 
15841 cgcgcgctcg gtgtcccgtt cgccgagcgc 
15901 cgcgcccacc gacaccagcg cccggtacgc 
15961 tcccagcatc aggtgtcgcc cctccccaga 
16021 cgtggcggga aacggccacg gcgcaaaccc 
16081 cactctgcca aggtggcgtc gccgggtggt 
16141 ccccacctcg gagaactgtt cgacgcgtcc 
16201 gccccgaatt tgccgatgcc ccgtttccat 
16261 gagtcagcag gggggaaggc ccaccggtcc 
16321 gtgggggcag ctgtacgacg actccaccga 
16381 cctggacgac cgcttcgcct cggcgtcgga 
16441 ccccggcgag cccacgcccc cggggccgcc 
16501 gccctggtgg cccaggaccc cgcccgccgc 
16561 cccaccgctc cacgcggacc tgccgggacc 
16621 accgccgccc gccgggggga ccggctcggg 
16681 cccggaggcg ggggacggcg ggcccggcga 
16741 ccggcccacg gctccctggg agcctccgcc 
16801 ggccgtgagc gccgaccccg gcgagggcgc 
16861 aactccgccg ccgggtgaag cgtcgggtcc 
16921 gcctgccgcg ccgaggcggc cggatccgcg 
16981 cgacctcaga gagccgcccg aggcggtccc 
17041 gcttccgccg cgcttcctcc cctcgacgga 
17101 gcccacctac gacgccgagc ccaccgcgct 
17161 cctggtgccg gacaccgtcc tggagggcgc 
17221 ctccgtgcgc ggggactccg cgcgctaccg 



tcgacgccga cccgtacgcc gtcgacgagc 
tggtcgccgc ccgccccggg ctgcgctcac 
tgcgcgccct ggtggggcgc accgaggccg 
tcgacgcgcc ctgcggcacc ctcacccacc 
ccgcgccgca cggcaccccg ggcgcgctcg 
tggaccccgg cgccgaccgg gacgacgccg 
acgcccgtac cgtcgccgtc gtgcgcaccc 
ccggcgccgc cgtccccgac acctggcgcc 
gcgcagcagg agagtgggag aacgaccgat 
gagcatcccc gccgagacgt actggcacga 
caccgccggc tccgacggcg cgctcacctc 
gagcgtccgg gacgggtggc ggcacgacgc 
cggcgcctac ttcgccgggg agctgacgga 
cgccttccgg gagcgcgtgt gggcggcatt 
gtacggcgag atcgccgcgc gcatcggggc 
tgcgatcggc gcgaatccgc tgctgatcct 
cggctccctg accgggtacg cgggcgggct 
gggcgccccg ctcagccggc cagtaccgct 
gcgaacagct cggtcagcag gtcggcgccg 
atcgcctcgc cgtgccgccc cagccgcagc 
ccggggaccg cgccgagcac ggggaccagg 
tacgccgtca gccggcgcat ggcctcgtcc 
cgcaccgccc gggacagcag cagcacgacg 
ggccgcgggt ccttcagcgc ccaccacagc 
acgccgggca gcaggacgcc gcacagcccg 
tcccacgctc ccatctgtcc agggtgcagg 
ggcgggcggg gcccctcagt ccctcctggc 
cagctggagc cgggtggtga ccccggtcag 
ggtcctgagg cccaggtcga gctgcttggc 
cagggagagg atctccagat cggtgccgtc 
ggaggcaccc agacgcagcg ggagggcgtc 
cagttcgagc agaccgctgg cgtgcaccac 
gagcgggacg agggcgaggg agcggtcggc 
gacccggacc tgctcgtcgc ggcccagcgc 
gtcgaggacc gaccgctcga cgaccacccg 
ctccgcgtcg ttctccatgc cggtcacggc 
ctcctcgctc gcgccgagct ggagctggag 
caccacctcc accaggtcgt gcaccgcggg 
cagcagcgcc gccgccagct ccgcccgctc 
gcccagcgcg acgcccggcg gggccgcgac 
ggcgagcccg ttctgctcca gccggcgcag 
cagccgcctg gccagatccg gtacgtcggc 
cgcctcgtgc gtgtcctcca gccccagcac 
accatgccca cctccacctg cgcggacgtc 
gcctcgggac atcatcgccg taccgccggt 
cctcccgtgg ggggtggggc cgcccggcca 
cccgacacgc cgccgggcgc acgccggtcc 
ccggccccat cggtggacaa tcaggagcat 
cgccggtccc accggacctg aggacgactg 
cgacaccggc cccacggccg cacccgattc 
cacactggcc cgcggaccgg cggcccccgg 
accgtccccc accgcgcccg aaccgggacg 
cgtccccggc ccccgcaccg gggacgacgc 
ggacgacggc gacccgaggc acgacgcccc 
gagacgcggg ccgtaccggt ctccgggctc 
gggtgggccc ccgcgggctg ccgggccgtc 
cgcgaccccc gccggcccga ggagctttcc 
gtcggacgcg ccgcggcccg cgcccgccgg 
cgtgtcgtcg ccggcgtccg gtgccgggcg 
tgggccgacc acggccgggt ccggggccgc 
tccgccccgg accctgcccg acctgccgcc 
ggccgaccgc gactacgtcg gcagcgggcc 
gccgcccgcc gatcccgacg gcctcgacgg 
ccggtacggg acgtgcacgc tgcgggccgt 
gggcgagccg cgtcgcgacg cgctgctcgt 
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17281 cgcccggttc ggcgccggag agcaggcgct 
17341 tgccaccgcc ggggcgcacc gggcggcggc 
17401 gggccgcagc cacgcccggc tcgccgagga 
174 61 gtccggcctg caccgcctca ccgaccgcag 
17521 acagggcctg gcccccgacg agtacgcggc 
17581 ccccgggtgc cgtacgcggg tgttcttcgg 
17 641 cgacggcgcc tggcaggaca tggagccgga 
17701 cggcttcggc tccgcacccc gcgacgcacc 
17761 cgaccggctg accatggacc tcggcatcac 
17821 cgccgggccg ccccgcgagc ccttccggtt 
17881 gctcctgatg tgcaccgcgg gcctcgccga 
17941 actcctcgcc cggcgatggg ccgcccgccc 
18001 cagcggcgtg cgggtcaagg gatacgcgga 
18061 gtgagaccgc gcgctgtggc atagctggac 
18121 tcgtcgacat cctcacccgc gccggggtcg 
18181 tcaaccccgt cgtggacgcc gtgcgccgcc 
18241 acgaggagac cgccgccttc gccgccgggg 
18301 cgtgcgccgg ctcctgcggc cccggcaacc 
18361 accgctccat ggcccccgtc ctcgccctgg 
18421 tcggcttctt ccaggagacc caccccgacc 
18481 agctgatctc cagcccgaag cagatgcccc 
18541 tcggccaggg cggcgtcagc gtcgtctcgc 
18601 cgcagggggc cgccgagacg gccctcgtca 
18661 aggagatcga ccggctggtc cggatgatcg 
18721 gcagcggcac cgcgggcgcg cacgccgagg 
18781 cggtcgggca cgccctgcgg ggcaaggagt 
18841 gcatgagcgg actgctcggc tacggcgccg 
18901 tgctcctgat cggcaccgac ttcccgtaca 
18961 cccagatcga cgtgcggccc gagcacctgg 
19021 ggggcgacgc gcgggagacg ctgcgctgcc 
19081 gccgcttcct ggaccggatg ctgaagaagc 
19141 cgtacacccg caaggtcgac aagcacgtcc 
19201 tggacgagat ggccgacgac gacgcggtgt 
19261 gggcggctcg ttacatctcg cccaacgggc 
19321 gttcgatggc gaacgcgctg ccgatggcga 
19381 aggtcgtgtc gatgtccggc gacggcgggt 
19441 tcgtccagca cgacctgccg gtgaagatcg 
19501 tcgagttgga gatgctcgtc gcgggcctgc 
19561 acttcgccgc cgtcgccgag gcctgcggcg 
19621 acctggccgg ggcgctgaag gcggcgttca 
19681 tcaccgaccc caacgccctg tccatcccgc 
19741 tcgccctgtc cgcctcgaag atcgtcctgg 
19801 cccggtcgaa cctgcgcaac gtcccccgcc 
19861 ggatgactgg cgagggctcg aacgggcaag 
19921 gggacggacg gggggagacg acatcagcgc 
19981 gccacagctc aggcggcggc tcggccgggc 
20041 ggcgctgcgt gagctgctgc ggcactgggg 
20101 gctcgccagc gagctggtca ccaacgcgct 
20161 cgcgacggtc gggccgcgcg ccctgcgggt 
20221 gcagccccgg ccgcgcgcac cgcagcggga 
20281 ggtcgagtcg ctcgccgacg actggggcgt 
20341 gttcgaactg ggcgcgggag cggaagcgga 
20401 cggcgcctca ggccatcggt atcggatacg 
20461 ccaggtcctt gagcttgcgc tccagggagt 
20521 ccgcggtgag gtcgacggtc cccacggaca 
20581 gggagggccg ccggctcgcc gcgggcagtt 
20641 cggcgcgttc cgagcctcgc tcgacctgct 
20701 gcccggggaa gccgcggtgg ccgtggcccc 
20761 gccgctcctg ctcgcgccgc cggagcttct 
20821 cctcctcgac cgcctcgtcc aggctgcgca 
20881 ggtacgtctc gcggggcgcg cgcagccagc 
20941 gcaccagacc ctgctcgcgc agggcggccc 
21001 gcaccgccgc cgacagggac atgcccgcga 
21061 cgcgcggcgc gtgcacccag ttgaaccagg 
21121 tccgcgagcc gagcgccgcg tcaccgtggc 



ggtcctggtg gcgatggcga ccggcgcccg 
cgaggtgtgc cggtggatcg gacgggccgt 
cctgcgggcc gcccggcgcg gcgacctgaa 
cctcggccgg ctccgggccg gtgctgccga 
caccctgcgc tgcctgctgc tgcccgccga 
cgtcggcgcg ggcggactgc tcagaccgcg 
cgccggtgac gtcacgggag agcccgtcct 
ccgcgacgca ccgcgcgaga cgcccgagga 
cacatccccg ggcccgtacg aggggccgcc 
ccgggcctcc gtagcccgcc cgggtgatgt 
gccactgctg agcgagcccg gactgggcga 
ggcgcccgga ccgggggagt tcctcgccga 
cgaccgtacc gccgccgccg tttgggaggc 
ccatggccaa acagaacgtc gcggaacagt 
agcgcctcta cggagtcgtc ggggacagcc 
attccggcat cgaatgggtg cacgtacggc 
cggaggcgca gatcaccggg aagctcaccg 
tccacctcat caacgggctc tacgacgccc 
cctcccagat cccgtccagc gagatcgggc 
agctgttccg cgagtgcagt cactacagcg 
ggctgctcca gaccgccatc caacacgccg 
tgcccggcga catcgcggac gagcccgccc 
cctcccggcc caccgtccgg cccggcgacg 
acgacgccga caaggtcacc ctgttctgcg 
tcatggagtt cgccgggaag ctcaaggcac 
tcatccagta cgacaacccg tacgacgtcg 
cctacgaggc cacccacgag tgcgacctgc 
acgccttcct cccggacgac gtgaagatcg 
gacggcgctc caagctggac ctcgcggtgt 
tgatcccccg ggtcaaggag aagaagaacc 
acgccgacgc cctggaaggc gtcgtcaagg 
cgatccaccc cgagtacgtg gccgccctgc 
tcaccgtcga caccgggatg tgcaacgtct 
gccgccgcat catcggttcc ttctcccacg 
tcggcgccca gttcaccgac cggcgccgac 
tcaccatgct gatgggcgac ttcctcaccc 
tgctcttcaa caactcctcg ctcggcatgg 
cctcgcacgg ggtggccaac aagaacccgg 
ccttcggcgt ccgggtggag aagcccaagg 
agcacaaggg tcccgccctc gtcgacgtgg 
cgaagatcag cgccgacatg gtcaccggct 
acggcggggt cggacgcatg ctccagatgg 
cgtgaccgta cggacagccg gttcgaaacc 
gctttccccg tgaagttgtt cgaccggagc 
gccgacgggg gacatgaagg agcaggcggt 
ggacctgcgg gcggtgcccg aggcccggcg 
cgggcccggc cagtcggagg tcgcggagct 
cgtgcacacc gacgagggcg cggtcctgac 
cgaggtacgg gacttcgtgg gccggggccg 
ggagagcacc aacggcagag ggctggtcct 
acagccctgc gaggtgggca agtcggtctg 
agcggaagcg gcctgaggcg ccgggaagac 
gggcgcgtcc ggctcagccg aactgctgct 
ccaggcgggg cagggcctgg gtgtcgtcct 
tctgctcaga cccgctccgc acgggctgca 
gctccggggt cgatatggca ggctccgcgg 
gcacctcgac ctcgacctga cggccgccgc 
ggctgatcgc cttgagctgc gcccgctcca 
tctcctcctt ctggcgccgg tcgtcgcgca 
cgccctccag gagcatcagc gaccaggcgc 
ggaccatgcg gatctgcggc agcggacgcg 
ggcgggtctg cttcagggcc cggtcgaaca 
agaagtgcgg ggcacccgcg tgccccaggc 
ccgcggcgaa ggcgaaggcc cacacgagta 
tggcctcgcg caccgcgagg acggagcaga 
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21181 acatcgccgc cccgtcgagc ccgaacggga 
21241 ggttctgctc gccgaagccg accagaccgt 
21301 agcagaagag caggacgtag gagaaggtgc 
21361 cctccatgcg ctcccacgag tcgtccgcgc 
21421 gcgcgagcac cgccaccgcc gccagcatgc 
21481 agttcagcga tatgtcggtc aatctcatct 
21541 gcccgccatg gtggcccaaa cccagcggcc 
21601 agggagtgca gagggggtgc ccagggcggc 
21661 gagttgagtt cgattaagac tacccgtacg 
21721 tgggggagtt gtggaacaac gcggacgatc 
21781 gagcagcttg tccacgcggt cggcgtcgca 
21841 ggggcgcagg gtgtagaaca tgcagcagct 
21901 gtccggcccc ttcagttcac ggaaggccgc 
21961 cagcagcagc tccagctcgt gccgcgcccg 
22021 ccacaggccc tcgacgacct cgtcggtcgc 
22081 catccggggg ccgaatccgg ccagcaccgg 
22141 ctcggcgcgc agcgcctcct cgtcggcgac 
22201 gtcgccgggg aggcaggcga aggacgcggc 
22261 ggcggccgtg cggtcgtagg acacgtgggc 
22321 gaaccacggc acggtgatca gcaggcaggc 
22381 cgcgatcacg tccggccggg cccgctgccc 
224 41 ggcgaggaag gccgccagct ccgtcccgcc 
22501 gccgccgctc gggacctcgt cggccgcggc 
22561 ctcggacagg cgcgtgtagg cgtccgtgag 
22621 ccgatccacg tcggaacaac gggctgctta 
22681 gggatttgaa ctgacggccc ctgcgcttat 
22741 taatgacccc aagtaccgac acgtcccaca 
22801 gccgcaagaa cgggtgtggg tacgggtacg 
22861 gtgcagccgg gagcggcgga cagggcacgg 
22921 ggcccggcac gcggcgagca cacccacggc 
22981 gtccagcgtt cctccgtgcg cggacagatc 
23041 ggtgaactgc ggcccggcga ggtctactcg 
23101 tcggccaccc ccgtgcggga ggccatgcag 
23161 gtccccaacc ggggcttccg cgtcctcgaa 
23221 gaggtccggg ccctgatcga ggtcccggtg 
23281 gagcactggg cggagctgcg cccgctcgcc 
23341 tgcccggcca cctacgccga ggccgaccgc 
23401 ggcaacgaac agctcgtccg gatcgccggt 
23461 gccggctcgc cctccgtgcg cggccgggcc 
23521 gcgctgctgg acgcgctgat cgccggcgac 
23581 cacttcacag tcacgtcgtg agccggaggc 
23641 gacggggccc ccggaccgga cgtgccccgg 
23701 gggcttcgcc cggatcggcg ttcccgggcc 
23761 cgctcggacg gggcgggcca cggaccgcgc 
23821 ggcccggacc ttcctgcccg cccctgctag 
23881 ggcacctgct tccccggctt ctcggacttc 
23941 cgcccggggc gggaaccggt gtcgcggttg 
24001 aagaccgacg agtacgtgcc gacccccacg 
24061 agcgagtcgt cggccaggat cgccagcgag 
24121 ttgaccgtgc gcggcagggt ctgcaggatc 
24181 ttccgctcct tgcccagcag ttcccggatg 
24241 gagtagccga tcacggtcag cagcgccgcc 
24301 caggcgaaca caccgaccag aatgaccacg 
24361 ccgaacagca ggcggaaccg gaccgccagg 
24421 agggcgatca gggcgtcccg gcgcagctcc 
24481 cggaccttct ccgtctcgcc gcccagttcg 
24541 gtgtcggtca gctcctccgt gcgcaccgtg 
24 601 gcgcgcggga agcccgcgtc ggccagggcg 
24 661 gcggtggagt actcgatgag ccgcccgccg 
24721 accaggatgc cggagcccgc caccacaagg 
24781 ggccggcgca tcaggaacgg gtcgcggcgc 
24841 gagatgccgg tgatacgggg gcggcggaag 
24 901 agcacccggg tgatcaccag ggcgctgacc 
24 961 acgccgaagc ccttgaccgg gccggaggcc 
25021 gtggtgatgt tggagtcggc gatcgcgctg 



ccaggtactg ccagccgtcg ctcaggccga 
ggaaggagag cgccgcggcg accgcggcac 
cgtagagggc ctccttgcgc cggcggcgct 
ccgcgtcctt ccccgaggaa cgcttgccgc 
ccaggagcag cacggcgccc ggaagcagcc 
ggggtccctt gcagtgggat agggcgtaac 
ctcagggggt ttcggggcaa gaggccgcca 
gttctgctcg aactgccgcg tgaggggcgg 
ggtggttcca cggaaagttc ccgcgacgtg 
ttagccgacg gcccgtcagc ccgccgcggc 
ggtccgcggg caggtggcgc aggtgtcctc 
cgcccggtcc cgggtgtgca gcggggcgcc 
cttcccgacg tacggcctgg tggcgcccgg 
ctcctgctcg ccgagcaggt gggcgacgta 
catcccccac agggcacgtc cgcgccgccg 
ctccaggtgc tcggcgaccg cggcccgcac 
cacccgggcg ccgggcagcg cggcggcggg 
acgcaccgcc atccggccga gggggagccc 
cgcggggtgg cggggcaccc ggcggtgcag 
gggccacgcg taccggtgca ggccgaagct 
gtagtcccgc agcacctggg cctcgtccca 
cgcggcgagg gaggcggccg tgacccagcc 
cagttcggtc acgcccagcc cggggaggac 
ggccgtacgg gcgacaggca tgcgggacca 
aaggtaagcc ttaccttacc caatatgttc 
ggtgcttgac ggacaggtga caacccgccg 
ggaggacccg tgaagcacag cgcgcagggc 
ggggcggcgg ccgaagcggt gcggatcccg 
ggccgggccc ccggcggtgc ggagccggac 
gaaccgccgg tgccccgccc gcgggccctg 
ctcgacgccc tgcgctccgc gctggtcacc 
gcaccggtgc tcggcgagcg gttcggcgtc 
cagctcgccc tggagggcgc cgtcgaggtc 
cgcggtgacc gcgagctggc cgaactggcc 
tggctccggc tggcccgcac ggtgcccgcg 
gagggcaccg tccgggccgc gtcctccggc 
gccttccacc gcgccgcgct cgcgctggcc 
gacgtgcacc gccgcgccca gtggccgccg 
gacctggtcg ccgacgcgca ccagcacacc 
ttggacgtgg tgcgggccct ggtcggggac 
ccggcagggc gcccttcacc cgggcgcccg 
gtgggacgtg ccccgggccg gtcgtccgcc 
ggggtcccgg gtcggcgttc ccgggccggg 
gccccgggcc cgggcgccct tgagcgaagg 
cggacctcct gccgttccgt gcggctcgtg 
cccgtggcgc gggtccggcc gcccctgcgg 
tgcagctcga tggccagggg agaggcggtg 
ccgatcagca gcgcgagcgc gaagtcggtg 
gccaggatga gcaccgcgcc catgccggtg 
gcgtcgttgg tgagccggtc gaacggtgcc 
cggtcgaaga gcaccaccga gtcgttcacc 
aggaacaccc cgtcgatcgg cttgcccagc 
tcgtgggcga gcgcgccgac cgccgccgtg 
tacgccagct gcgcggccag cgccaggccg 
tcgccgaggc tcgggccgat cagctcgtcg 
gcgacggcct tcgtgacggt cgccgcctcg 
aggtcgccgt cgccggagga ctggacgacg 
tcccgcgccc ggtcgggatc gacctgggtg 
gtgaactcga tgccgaagtt caggccccgc 
acgatcagcg aggccgccag ccagcggcgc 
agcagggcgt cacggaccgg gccggtcgag 
accgccgggc ggctcgcggc gaactcggcg 
atcgaggcca gcacaccgat gccgagggtg 
aggaagaaca gcagggcggc ggcgatcagc 
aaggcgctgc ggaaaccggc ggtcagcgcg 
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25081 gagcgcgtac tgggacgggt ccgggccgcc 
25141 acgttggcgt ccaccgccat gccgatcgcc 
25201 agggtcgcgc cgaccgcggc cagggcggcg 
25261 acggtcgcga gggcgcccat cagccggtag 
25321 gtgccgatga cggcggccca ggcgccggcg 
25381 atggtgcgct gctcgatggt ctcgaccggc 
25441 gccagttcgc gcgcctcggc gtcgtcgaag 
25501 atgcccgccc cgcaggacac ggaggggtcg 
25561 agcacgatgg cgacccggcg cgtggggtca 
25621 acctgcgccc agcggtcgct gccggagtcc 
25681 ccgttctgct ggtcgaagcg ggcgtcggcg 
25741 gccttcagcc gcagggactg cccggactcg 
25801 gtctcggtcc tgccctcggt cgcgtcggcg 
25861 acggtgagct gcgcggtgcg gccgagcacg 
25921 ggcagctcga cgacgacccg gttctcgccg 
25981 agggcgtcga tgcggccgcg cagcacctcc 
26041 ttggtggtct cggtcgactt ggtctccagg 
26101 cccaggcgga ccggcatggt cagggcgatg 
26161 acgagggcgc ggacgttcag ggaacgggac 
26221 cgggagcgtt tcaagacggg cctccggcgg 
26281 cggcggaaga agatgcaggg aaggagtggt 
26341 ccggccgctg tcgtggctcg cgcggccggg 
26401 cgtccgcgcg tggccgcagg gggcggcgcc 
264 61 ctggtccggt gccgtcggac gcccggccgg 
26521 gcagaccggc gtgcagccgt cgtcggcgtg 
26581 ggcggcggcc gccgcgaggg gaccgctcgg 
26641 cgggaccagg ggcagcagga cggtcagcag 
26701 catgcaccgt cccctttcct ggtggatgag 
26761 ccggccgggg agtgggcggc cagagccacg 
26821 agcccggcgc ggatggcgta ccgggtcagc 
26881 agcaggttcg cccggtgccg ctggacggtc 
26941 tccttcgagg agtggccctc ggcgacgagc 
27001 atgtggtcgg aggtctcctc gccgtgccgc 
27061 gtcaccgcgc ccgggtacag gaacggctcg 
27121 aggtcccggt cggcgaccga cttgagcacg 
27181 aacaggtact gctcgttgtc gtgcatcgtc 
27241 gcggccagtt cgcgggccgc ctggaggccg 
27301 gcgaggtcgg cctcgtgggc ccgcgccatg 
27361 gccaccacct ccaggtccgg ctcccggtcg 
27421 gcgtggtcgt cggcgagcaa gatgcggatc 
27481 tacgcggacg tcgtggacgc ggacgtgggc 
27541 cggccgcgcg cggacggctc ggacggctcg 
27601 acttcctggg aaggggtgcg gtgagccgga 
27661 tgtcgagggc ggccccgatg agcagggccc 
27721 cgtgcgccgc ctcgatgccg cgcccgtcgt 
27781 cgtgggccag gcccacctcc aggcgctcgg 
27841 tctcctgagc cacccggtag aggaccagtt 
27901 cgtcgaaacg gcgcaccacc cgcagaccgg 
27961 acagcgcgct gatcagaccg aggtcgtcca 
28021 ggcggacctc gtccaggctc tcccgggtga 
28081 gcggctcctc ggcgtcgtcc gcggcacgcc 
28141 gcccgacctc gtcgtgcagc tcctgggcga 
28201 ggacgcgggc gctgctggtg gcccgctcgt 
28261 tgcggatcag ctccgggacc tcgccgccgc 
28321 gcaggtcgac ggtggtcatc agcctggtga 
28381 gcagggcacc gttggccacc agcatgacgc 
28441 tcagcaggat cggcacggag acggtcaccg 
28501 ccagcaccac cgcgttgagc ccgaagatcc 
28561 cttcggcgac gtgttgattt ggtacgcctc 
28621 agctgcggcc caggtagcga ccgagcaagc 
28681 tcgatggggc ccgaccccca tttccggccc 
28741 ccgccgagtg gatcccgcac ccccgggcaa 
28801 gcggcccggc cagggttgag gacgaccggg 
28861 agcaccttcc gagaaaagga ccactcatcg 
28921 gccccgagcg gctgaccgct cacgaggcgc 



tgttcctcgc gggcccgctc gaagaccagg 
agcacgaagc ccgccaggcc cggcagggtc 
taggagatca ggccgtagca gagcagggcc 
acgacgatga tgaacagcgc ggtcagcgcg 
tcgatggcct cgtcgcccag ggtggcgccg 
accggcaggg cgccgccctt gatgagcagg 
gagccggtga tctgcgtgga gccgccggtg 
acctgcggcg aggagatgat cttgtcgtcg 
ccggccgggt tgcaggccgc ctcgccggtc 
ttgaagtcga cggtgacggt ccagccggcg 
cccttgacgt cctgaccggt cagcgtggcg 
tccggcagca cccgctcgcc gtcctcggcg 
ggcttctccg ccgcgccgag caccgagtgg 
tcggcggcct tcttcgggtc ctgtacgccc 
gagcggacga tggtcggctc ggcgacgccg 
acggtgcggt cggtggcctc ccggtcggcc 
acgatctggg tgccgccgcg caggtcgagc 
gccacggccc cggccatcac ggccagcgcg 
cgcgtacggg aacgggagcg ggaaggggga 
gcacgccgcg gcggcggccg agcgcggccg 
gtctcaggtg ccggaggaga ccggtggcgc 
ggagaccggc accgggccgg gcgcgaacga 
gcccccgagg tgggcggtgg tcgccgggtg 
ctggtcgtgc cgggctcgag gctgggcggc 
gaggtcgagg gccgggtgcg gcacggccgt 
accgacgtgc gcaccggccg cgggccaggc 
gacggtgagc acgctcgcga gccgtcgtcg 
cgggcctggc gggtggggga gcccggggct 
ggcggttcgg gggaccgtca gggctcgatg 
tccaggcggt cgcgcaggcc gagcttctgc 
ttgatgctga tgaagaggat ctcggcgatc 
ttgaggacct cctcctcgcg cggggtgagg 
acccggtcga ggtagttgcg gatgagcgcg 
tcgcgcatcg cggcgcggca ggcggcgacg 
tatccgcagg cccccgactt cagcgcctgg 
agcatcagga tgcgcagccc cggtttcagc 
gtgaggcgcg gcatcgcgat gtccaggacg 
tcgatggcct ccgcgccgtc cccggcctcg 
aggatgaggc gcacaccgcg gcgcaccagc 
ttcgaggggg cggacgccgg gggtgtgccg 
gtcgtgggcg tcgtggacgc gccggcgtgg 
ggcagggacg ggtcggacat ggtcggtgct 
tgcgggtgcc ggcgccgggt gccgacgtga 
gttcccgcat gccgcggatg ccggcgccct 
cggcgacggt gagcgtcacc gcggcgtcgg 
cgtccgcgtg gcgggcggcg ttggtcaggc 
cggtctcgtg gtccaggacc ggcagatcgg 
tgtgggtggc gaagtcgtgc gtgagcgagg 
ggacaccggg ccgcagccgg cgcaccaggc 
tctcctgcgc ctggtgcagc tcgtcgcgca 
ccaggaccag caggatcgcg gtcatgctct 
tgcggcgccg ttccgcctcc tgggccagca 
tctccaggcg gtccagcatg gcgttgaagg 
cggagatcgg cagccgctgg ccggggcgca 
gccgctccag cggcgccagg ccccagcgca 
ccatgccgac cacgaggacg acggcctcgg 
gagcccacag cagcagcgcc gtggcgaagc 
gccagaacag ggacaccggg ggtacgcctc 
tactgtcggt catcgcgcac cctcggtggg 
gaccagagtg ctcctcgcgg accggtccgg 
gcggcatacc catatggccg ccccggcgca 
atgggtcctg cgcccgatgg gtttccccgg 
tccgatcggc ccggcactcg gcatacccgc 
tgtcgctcga cgcctcccgc atcgaacccc 
gccagcgcct cgaacacgcc cgcaacaccc 



19 of 20 



3/28/01 3:51 PM 



NCBI Sequence Viewer 



hnp://ww\v.ncbi.nlm.nih^ 



28981 gcgtcaccca gcttcaggcc ctcgccgaga 

29041 cgcagaaggc cgcgatcgag cgggtgctca 

29101 aggagggcac ctacggcgcc tgcctcggct 

29161 agatcctccc ctacacgcgg tactgcgtcg 

29221 ccgtcccacc gtcccaccgt ccccgtccag 

29281 aagtggtgaa caaccagatc atcggcgacc 

29341 aggacctcgc cgcgctccgg gacaacctcc 



gcggccaggc ggacgaccag ctgatgtccg 
aggagatcga cgaggccttc gcccgggtcg 
gcggcaagcc cgtccccggg gagcgcctgg 
cctgccagcg ccgcgcggcc gcctgacgcc 
taccgtccct cccgccctgc ccaaggggtg 
gcgacacgcg tctgccgccc ctgtcgcccg 
gggagcaacg gctgttccgc gaggagcaac 
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